CHAPTER 1 


Data presentation and collection 


Objectives 


Introduction 


When you have read this chapter you 

should be able to: 

► construct tally charts and assemble 
data in the form of frequency tables; 

► know the difference between discrete 
data and continuous data; 

► present a visual display of frequency 
tables in the form of equal and unequal 
width histograms; 

► use the histogram as a basis for 
constructing frequency polygons and 
frequency curves; 

► develop and interpret cumulative 
frequency tables and their associated 
diagrams; 

► present and use various visual 
approaches involving bar charts, pie 
charts, Lorenz curves, etc., which 
are widely used as a means of 
communicating patterns and trends 
in data. 


Statistics, as a subject, is a study of the various tech¬ 
niques of giving meaning to untreated or raw data. In 
this chapter we consider some simple, visual techniques 
which will be helpful in identifying patterns and trends 
in data. These will often involve drawing diagrams such 
as histograms, frequency curves, bar graphs, pie charts, 
Lorenz curves, etc. However we start by considering some 
simple procedures of data collection involving tally charts 
and the presentation of the data collected in a more 
'grouped' form using class intervals and frequency tables. 
Ideas which involve selecting samples from all the data 
available and interpreting the results from sample data 
are considered in Chapter 7. 

Answers to the 'Self-check questions', responses to 
each 'Pause for thought' and answers to the 'Review 
questions' can be found at the end of the book (pp. 
320-458). 


1.1 Frequency distribution 


Frequency is the number of times a certain event has 
happened, and is often found by means of a tally chart. For example, if you were 
conducting a survey of defective items produced by machines of different ages 
(in months) in an assembly workshop, you might record your findings as shown 
in the tally chart of Figure 1.1. 

Each time a defective item is identified as coming from a machine of a 
particular age, a vertical line is drawn, and should a fifth such lint bt needed a 
diagonal line is drawn through the previous four vertical lines. At the end of the 
data collection period the total number of tally marks are added for each age 
of machinery and the total placed in the end column marked F for frequency. 
The total frequency (80) is found by adding these individual (class) frequencies 

together. 






2 quantitative 

figure 1.1 

Tally chart for 
defective items 
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Age in months 

Tally 

Frequency (F) 

0 and under 5 

II 

2 

5 and under 10 

III 

3 

10 and under 15 

Wt H 

7 

15 and under 20 

Wit llll 

9 

20 and under 25 

mthtt wuw 

20 

25 and under 30 

WtWtt Wft II 

17 

30 and under 35 

wtwtwuw II 

22 

Total 


80 


1.2 Frequency tables 

Figure 1.1 is essentially a frequency table, and it will be useful at the outset 
to define a number of terms often encountered when discussing such tables. 

► Class intervals: these are the groups into which we organise the data. 

There are seven class intervals in Figure 1.1, the first being '0 and under 
5' and so on. 

► Class boundaries, these are the values at which the different class intervals 

exa ” 1 Pl e the second class interval has a lower class boundary 
( CB) of 5 and an upper class boundary (UCB) of 10. 

^hown^theb-tT 316 and u pper values of the class interval as 

the class boundaries ^However thh^ ^ ^ daSS UmitS are the Same 35 
ajre^™ be the case. Suppose 

Marks Frequency 


0 to 9 
10 to 19 
20 to 29 
30 to 39 


5 

6 
14 
18 


The second class interval h ac , 

•able, bu, class boundaries of 97° f 10 and 19 “ shown in the fr 
between class limits, the mid DO b" , o' Whw "'"re a gap in th 

“ ; nt T alS m " et - u - the class bo, 8ap wUI 1,0 the value at wl 

second cl' 6 f " St Class ln,civ al, where" !" y 'l" exnm l’ lc a niark below 
second cl„ ss mterval. '’ Ue " s •> mark of <>. s ot abow g0 

► Class width (or si/. c y thi i 

u::;rrr ,ies ,or ^ ^ im!*rvi? c A between the upper and u 

► 

I bculai class interval. ° of ° bs ervatlons found to occur i 
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WORKED 
EXAMPLE 1.1 


A frequency table is useful in giving an immediate impression of how the raw 
data are distributed. We can see from Figure 1.1 that, although the relationship 
is not precise, the older the machine the greater the number of defective items 
that were found to occur. 

The following steps may help when you try to construct your own fre¬ 
quency table from raw data. 

Step 1 Determine the range of the data: 

range = maximum value - minimum value 

Step 2 Determine the number of class intervals to be shown in the table. 

This is largely a matter of opinion - too many class intervals will 
confuse the data whilst too few class intervals will mean that you 
lose much of the information. Most published frequency tables use 
between five and ten class intervals. 

Step 3 Determine the width (size) of class intervals. 

As a general rule the width of each class interval should be 
approximately equal to the range of the data divided by the number of 
class intervals. 

Sometimes Steps 2 and 3 are best reversed, as when you have 
already decided on an appropriate width (size) for your class interval. 

Step 4 Determine the first lower class limit. 

Step 5 Construct the frequency table using a tally chart. 


The percentage marks scored by 58 candidates seeking promotion in a personnel 
selection test were recorded as follows: 


37 

49 

58 

59 

56 

79 

62 

82 

53 

58 

34 

45 

40 

43 

44 

50 

42 

61 

54 

30 

49 

54 

76 

47 

64 

53 

64 

54 

60 

39 

49 

44 

47 

44 

25 

38 

55 

57 

54 

55 

59 

40 

31 

41 

53 

47 

58 

55 

59 

64 

56 

42 

38 

37 

33 

33 

47 

50 




Construct a frequency table of the data, using class intervals with a class width 
(size) of ten percentage marks. 


Solution 

Step 1 Determine the range of the data: 

maximum = 82; minimum = 25; range = 57 
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figure 1.2 

Tally chart for 
percentage scores 


Steps 2 and 3 Determine the width (size) and number of the class intervals. 

Width of class intervals = 10 (e.g. 20.5 to 30.5). Therefore sev en 
class intervals will be required to cover scores 21 to 90 (per cent) 
should we decide to set the first lower class limit at 21 , as in Step 4 
Step 4 First lower class limit = 21 

Step 5 Construct the frequency table 






Class 

Tally 

Frequency 



21-30 

II 

2 



31-40 

IWJHt 1 

11 



41-50 

wumutni 

17 



51-60 

WUtttWtWt 

20 



61-70 

1 W 

5 



71-80 

II 

2 



81-90 

1 

1 



1.3 Discrete or continuous data 


Data are either 'discrete' or 'continuous'. 


Discrete data 

These are data which can only take on a limited number of different values. 

For example: 

► the number of houses built 

► the number of goals scored 

► the number of marks scored in a test 

All these examples can only take on a limited or finite number of integer (whole 
number) values. 


Continuous data 

These are data which can take on an infinite number of different values (at least 
in theory): 

► your height 

► your weight 

► your age 

► the time taken to do a job 


f D YOU KNOW? 

wre has been a huge increase in the volume ol data 
firm must process. The OECD has estimated that it 
is grown sevenfold in the past 20 years. Despite the 
iwth of IT, some 92 million paper documents involving 
ta of one kind or another are produced each year. 


All these examples can take on am 
value, subject only to the minimum unit 
of measurement available. These values 
need not be integers; example, 0.001 cm 
1.24 grams, 4.03 seconds, and so on.Con¬ 
tinuous data are often ‘rounded ott’ to >< 
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certain number of decimal places even though still smaller measurements might 
theoretically be possible. 


PAUSE FOR THOUGHT 1.1 


Can you give 
of continuous data? 


two more examples of discrete data and two more examples 


1.4 Histograms 


The histogram is a visual means which is often used to display data which 
have been arranged in the form of a frequency table. The histogram can be used 
for displaying both discrete and continuous data. The histogram has a number of 
properties: 

► it is a set of rectangles, each of which represents the frequency of a 
particular class interval; 

► each rectangle is constructed so that its area is in proportion to the 
frequency of the class interval it represents; 

► the base of each rectangle represents the width (or size) of the class 
interval and the height of each rectangle represents the frequency of the 
class interval; 

► when all the class intervals have the same width (or size), then the vertical 
axis which represents the height of each rectangle is the class frequency; 

► when the class intervals are of unequal width (or size), then the height of 
each rectangle must be adjusted where it differs from the 'standard' class 
width, i.e. from the class width of the majority of class intervals. For 
example if the width of a particular class interval doubles (and with it the 
base of the rectangle), then we must halve the height of the rectangle 
used to represent its class frequency. We must do this to keep the areas of 
the rectangles proportional to the class frequencies. The vertical axis is no 
longer the frequency of the class interval but the frequency density, as we 
shall see below. 

Equal-width histograms 

Table 1.1 represents the data from our earlier example of defective items pro¬ 
duced by machines of varying age. 


table 1.1 

Frequency table for 
defective items by 
a ge of machine 


Age in months 

Frequency 

0-5 

2 

5-10 

3 

10-15 

7 

15-20 

9 

20-25 

20 

25-30 

17 

30-35 

22 


SO 
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business and economics 


figure 1.3 

Equal-width histogram 



As can be seen, each class interval has an equal class width (or size) ot tne 
months. This is sometimes called the standard class width. Tire equal-width 
histogram which represents these data is shown itr Figure 1.2. 

Note that the rectangles are connected together, that tire base on tat I' 
rectangle represents the standard class width (five months) and is the saint 
throughout, and that the height of each rectangle represents the tlass In 
quency. As a result the areas of the rectangles are in proportion to the resptttnt 
class frequencies, and so we can see at a glance that the bulk ot the fiequtnt ^ 
distribution is to the right of the histogram, in the rectangles 20-25. 25 -aO an^ 
20-25 months. We gain an immediate visual impression from the histogi* 11 
that the frequency (of defective items) increases broadly in relation to tlrt 4 - 
of the machinery. 
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table 1.2 

Frequency table 
for time taken 
by employees to 
complete a task 


table 1.3 

Frequency density 
for time taken by 
employees to 
complete a task 


Time (nearest minute) 

Frequency 

0-1 

42 

2 

35 

3 

30 

4 

21 

5 

15 

6-7 

15 

8-9 

6 

10 

1 


165 


Time 

(nearest minute) 

Frequency 

Class width 
(minutes) 

Frequency density 

0-1 

42 

1.5 

(42 + 1.5) 28 

2 

35 

1 

35 

3 

30 

1 

30 

4 

21 

1 

21 

5 

15 

1 

15 

6-7 

15 

2 

(15 - 2) 7.5 

8-9 

6 

2 

(6 + 2) 3 

10 

1 

1 

1 


Unequal-width histogram 

Sometimes the data will be such that the class intervals are of different widths 
(or sizes). In the previous example of defective items the width of each class 
interval was five months. In the next example (Table 1.2) however we have a 
frequency table in which the widths of the various class intervals vary, being 
either 1 minute, 1.5 minutes or 2 minutes. 

Remembering that the areas of the rectangles must be in proportion to the 
class frequencies, we must adjust the height of any rectangles in line with any 
variations in the base of that rectangle. So if the base (class width) doubles in 
length we must halve the height (class frequency) and so on. The value on the 
vertical axis is no longer frequency, but frequency density. These adjustments are 
shown in Table 1.3 and Figure 1.3. 

Remembering that the class width is the difference between upper and lower 
class boundaries, we can see that the first class interval has a width of 15 minutes 
and the sixth and seventh class intervals have class widths ot 2 minutes, with all 
other intervals having a class width of 1 minute. The adjusted frequency densities 
from Table 1.3 are shown in the uneqwal-widtli histogiam below (1 iguit 1. . 

The class width which appears most often in an unequal-width frequency 
table and histogram is often called the standard class width. In the above example 
the standard class width is I minute. We derive the frequency density by 
dividing the class frequency by the number of standard class widths in that class 
interval So, for example, we divide the six employees in the penultimate class 

interval by two to give a frequency density ot three. 

This approach can be seen more clearly in the following worked example. 
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figure 1.4 

Unequal-width 

histogram 


WORKED 
EXAMPLE 1.2 



A company wishes to review its distribution operation and from its time sheet 
records it found that 144 vehicles were loaded in a 24 hour period. A frequency 
distribution table was prepared from the data as follows. 


Time to load (minutes) 

Number of vehicles 

40-45 

4 

45-50 

13 

50-55 

17 

55-60 

44 

60-70 

59 

70-80 

7 


From the above data construct a histogram to represent the data. 


Solution 

Step 1 From the frequency table decide on a standard class width ~ 5 minute 
Step 2 Enlarge the original table, adding columns for class width, numbt'i oi 
standard class widths and height (frequency density). 
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Time 

Frequency 

Class width 

No. of standard 
class widths 

Height (frequency 
density) 

40-45 

4 

5 

1 

4 

45-50 

13 

5 

1 

13 

50-55 

17 

5 

1 

17 

55-60 

44 

5 

1 

44 

60-70 

59 

10 

2 

(59 + 2) 29.5 

70-80 

7 

10 

2 

(7-i-2) 3.5 


Note that where the standard class width has doubled, as in the case of the last 
two class intervals, we must halve the height (frequency density) of the 
corresponding rectangle. 

Step 3 Construct the histogram 


figure 1.5 

Histogram showing 
number of vehicles 
loaded in a 24 hour 
period 



1.5 Frequency polygon 


A frequency polygon is a straight line diagram which represents exactly the 
same area (i.e. frequency) as shown in the histogram. The frequency polygon is 
derived from the histogram by connecting the mid-points of the tops of each 
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figure 1.6 

Frequency polygon: 
area enclosed 
identical to 
histogram 


methods for business and economics 



adjacent rectangle by straight lines, as in Figure 1.6. “^^^^horizontal 
to oin the mid-points of the tops of the end two rectangles with the no 

axis, as shown, then a key property of the frequency pol ^ g °” S jjyequal 
become apparent: namely that the area under the histogram is ide Y 

to the area under the frequency polygon. are 

The shaded area is common to both polygon and histogram -Hentical 
eight respective pairs of congruent triangles (side-angle-side), giving i 
areas contained by polygon and histogram. 


1.6 Frequency curve 

From the previous discussion we can therefore state that the frequency ^ 
tribution represented by the histogram can equivalently be represents F ^ 
straight line polygon known as the frequency polygon. Further, if tlie p 
to be connected by straight lines were close together then, as an app r0X |^ ^ jn 
we could represent the frequency polygon by the frequency curve * 

Figure 1.7. Indeed the frequency curve is the most usual way of re P u - 0 f 
a frequency table visually. However, it is basically an alternative nn^ ^ 
displaying visually the properties of a frequency table already captuis 
histogram and frequency polygon. 

The two peaks indicated by the frequency curve would represen 
distribution, as we note in Chapter 2. 


1.7 Cumulative frequency curves 


Cumulative frequency is the 'running total' of the figures shbtyn in tlu * 
quency column of a frequency table. As we shall see we can draw up a “ m * ( 
ive 'more than' and cumulative 'less than' tables to represent data on in tl 
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figure 1.7 

Frequency curve as 
an approximation to 
the frequency 
polygon 


table 1.4 

Frequency table for 
height of trees (m) 


table 1.5 

Cumulative ‘less 
than’ frequency 
table for height of 
trees (m) 



Height (m) 

0-1.9 

2-2.4 

2.5-2.9 

3.0-3.4 

3.5-4.5 

Frequency 

15 

43 

30 

10 

2 


Height (m) Frequency Height less than (m) Cumulative frequency 


0 - 1.9 15 l - 95 

2-2.4 43 2.45 

2.5- 2.9 30 2.95 

3.0-3.4 10 3.45 

3.5- 4.5 2 4.55 


15 

58 

88 

98 

100 


wealth and many other variables. We can then draw cumulative frequency 
polygons or curves to give a visual picture of these tables. 

Table 1.4 illustrates the frequency distribution of the height of 100 trees in 
a plantation, each tree being measured to the nearest 0.1 m. 

We can, for example, produce a cumulative 'less than' frequency table (lab e 
1 5) This keeps a running total on how many trees are 'less than' a particular 
height As we saw with histograms, the width (size) of the class intervals (groups) 
are 8 slightly wider than the table suggests. For example 2-2.4 is really 1.95 - 

heiuht < 2 45, i.e. a class width of 0.5 (m). 

We now draw a cumulative frequency curve from this table by ptott.ng the 

cumulative frequency (on the vertical axis) against the 'less than value (on the 
horizontal axis). Here, we |oln up the points we have plotted with a smooth 
curve. This frequency curve is also known as an ogive . te crnvt p o tt in 
Pieure 1 8 is therefore the cumulative 'less than' curve (or ogive). 

You'can take note of some useful Information from such a curve, tor 
, all but two of the trees (i.e. 98 out of a hundred) have a height less 














12 QUANTITATIVE METHODS FOR 
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than 3.45 m. On the other hand, 
1.95 m in height. 


only 15 out of a hundred trees are less than 


PAUSE FOR THOUGHT 1.2 


Table 1.5 bat asing the lower c/assn™^ !ha "' freciuenc >' table using the ban 

c/ass boundaries for each class intenral. 


We can also use our cumulative 'less than' fr 

involving percentiles. A percentile • ^ fret l uenc y curve to answer ques 

given percentage of the distribut/ 5 a P art * Cu * ar vnlue below (or above) wh 
For example, 'what is the 1 \ • 

tribution will lie below that heS'V? *. for whlch 25 per cent of tin 

(nr fiist quartile'), shown as 2 is\ • US is t,le lower quartile hi 

frequency is I()(>, then 25 per cent .?,? FigUre KS - Note that since the 
u continuity correction (see box belowT * fU , S t * 1 ' s case - However becau 

. H ° Wer (|uar,ile observation let W ° S loultl stric tly regard 25.25 tre 
first, second (median) and third". Z C ° nsWer the continuity correction 
estimate those values. '^pectively, before using Figure I 
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Continuity correction 

szzTZzzr make siight adjustmen,s when <■■»«»• 

► lower (first) quartile position — (n + 1) 

4 

► median (second) quartile position - (n + l) 

► upper (third) quartile position — (n + 1) 

where n is the number of observations. 


It is perhaps helpful to explain the reason for this continuity correction by 
starting with the second quartile, the median. 

Median 

The median (second quartile) value is that value for which 50 per cent of the 
data are below it and 50 per cent above it. If we arrange the items of data in 
order (whether rising or falling in value), we need to find the value correspond¬ 
ing to the middle item. For example, with nine items, the median value will 
correspond to that of the fifth item arranged in order, i.e. the value 7 in the 
example below. 


2 4 4 5 (7) 10 11 14 18 


Note that the median position is given by — (n + 1) = - (9 + 1) = 5th item 

2 2 


arranged in order. However when the number of observations, n, is not an odd 
number but an even number, then the median value will lie mid-way between 
two observations. For example, with ten items as a result of adding the number 
20 to the previous nine items, the median value will correspond to that of the 
5.5th item arranged in order. 


median position 

2 4 4 5 7 t 10 11 14 18 20 

5.5th item 

Notice that the median position is given by i (n + 1) = — (10 + 1) = 5.5th item 
arranged in order. 

1 

We can therefore say that the median position is given by — (/) + 1), where 

this expression is known as the continuity correction. 

In the previous example with 100 trees, the median item in the cumulative 

frequency table is the —- (100 + 1) = 50.5th item. 

We can use our cumulative frequency curve (Figure 1.8) to find the median 
height of trees. We draw a horizontal line from 50.5 on the vertical (cumulative 
frequency) axis to the curve. We then draw a vertical line from the curve to the 














' et of 
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horizontal axis and 'read off the value i.e. around 2.4 m. So half the 
trees are below 2.4 m in height, and half above 2.4 m rn height. 

We follow a similar procedure for finding the rst and third quartile position 

as we do for the second quartile (median). However instead of I (i , e 2* 

1 3 ^ 

have - and - respectively. 

4 4 


PAUSE FOR THOUGHT 1.3 


_ At what value would you expect a cumulative ‘more than’ frequency curve 

to intersect a cumulative ‘less than' frequency curve? 


Lower quartile 

The lower quartile (first quartile) is that value for which a quarter (25 per 
cent) of the distribution lies below it. If there are n items in the distribution, 

we need to find the value corresponding to the — (n + l)th item, when the 

items are arranged in order. In our 'tree' example, n = 100 so — (r? + 1) = ^ 

= 25.25. 

We can therefore use our cumulative frequency curve to estimate the value 
of the lower quartile. This time draw the horizontal line from 25.25 on the 
vertical (cumulative frequency) axis to the curve, and read down to around 2.15 
metres on the horizontal axis. So a quarter of the trees are below 2.15 metres 
in height. 


Upper quartile 

The upper quartile (third quartile) is that value for which a quarter (25 per 
cent) of the distribution lies above it. (We therefore look three-quarters of the 
way up the distribution, arranged in order, where we find the upper quartile). 
If there are n items in the distribution, we need to find the value corresponding 

to the ^ (n + l)th item, when the items are arranged in order. If our tree 
example, n = 100 so ~ (100 + l) - 303 „ 75 75 

freauencvTa vk f T n ° W ““ aCr0 “ 4 from 7575 on the vertical 

reading of around a " d * hen down «° the horizontal axis. This 

height (i e three „ f m ' 0 0ne ' quarter of the ttees are above 2.65 metres m 

We can t the * ° f are 2 « metres in height). . 

distribution. Here ^ the llei ^ hts of trees at any percentile point of t u 

quintiles (20 per cents) ZrZ^T qUartiles (25 P er cents ^- but we couId haV0 ^ 
n f [ p cents) or Reties (10 per cents) 

round! Si? ^ CUmulative than' distribution the other ^ 

axis. For example we C in°"' i° * lorizontaI axis an d then moving to the v" “■ 

a height less than a snedfi- K ^i'* number (here percentage) of trees ha" - 

height less than .1 C Va ue> For sample 90 trees (90 per cent) have * 

Chapter 2. ,s niedian and lower and upper quartih- s 
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SELF-CHECK 

QUESTIONS 


1.1 The frequency table below shows the number and value of transactions at a 
corner shop taking place between the hours of 17.00 and 22.00. 


Value of transaction (£) 

Frequency 

0-3.99 

8 

4-7.99 

18 

8-11.99 

22 

12-15.99 

24 

16-19.99 

13 

20-23.99 

12 

24-27.99 

2 

28-31.99 

1 


Construct a histogram of the data. 

1.2 Here is the frequency table of the weekly income of 45 people in a survey. 
Draw a histogram to represent these data (note: unequal-width histogram). 


Weekly income (£) Frequency 

76-100 

6 

101-125 

8 

126-150 

17 

151-200 

8 

201-300 

6 


1.3 The following table shows the distribution of the ages of people attending a 
public concert. 


Age range (yrs) 

0-19 

20-39 

40-59 

60-79 

80-99 

No. of people 

8 

26 

110 

128 

56 


Construct a cumulative frequency (less than) table and draw the 
corresponding curve on a graph. Use your curve to estimate: 

(i) the median age attending the concert, 

(ii) the lower and upper quartile ages, 

(iii) the percentage of people over 65 attending the concert. 

1.4 (a) Complete the cumulative ‘more than’ table below: 


Height (m) 

Frequency 

Height more than (ni) Cumulative frequency 

0-1.9 

15 

0.0 

2-2.4 

43 

1.95 

2.5-2.9 

30 

2.45 

3.0-3.4 

10 

2.95 

3.5-4.5 

2 

3.45 



4.55 
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(b) Draw the cumulative ‘more than’ curve on Figure 1.8. 

(c) What do you notice about the point at which the cumulative ‘less than’ 
and ‘more than' curves intersect? 


Note: Answers can be found on pp. 320 323. 


1.8 Bar charts 


There are three main types of bar chart: 

► simple bar chart 

► component bar chart 

► multiple bar chart 

Simple bar chart 

A simple bar chart is a set of non-joining bars of equal width whose height 
or length is proportional to the frequency it is representing. They can be drawn 
in a vertical or horizontal format and can show negative, as well as positive, 
values (see Figure 1.9). 


Component bar chart 

A component or segmented bar chart is useful to illustrate a breakdown 
m the figures, e.g. the total sales of XYZ Co. could be broken down into sales 

by product (see Figure 1.10(a)). The constituent parts of each bar are always 
stacked m the same order with the height nf ™ i . , 

values or frequencies. 8 °' re P re *™>n« the mdrv.dual 


figure 1.9 

Simple bar charts 


300 T 
200 
too 
0 


VERTICAL 
Annual rainfall 


HORIZONTAL 

Budget variance by department 


~ y e variance 


+ ve variance 


1999 20«0 2001 1 
years 


10 



transport 



storage 



sales 

3 


personnel 



computer 

r- 


0 


0 


10 


Percentage of budget 
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figure 1.10 

Component and 
multiple bar charts 


U) 

O 

O 

o 

<4 


160 
140 
120 
100 
80- 
60- 
40- 
20 - 
0 


Sales by product 
;||j Product A Q Product C 

| | Product B 


W/A 




Sales by product 


WA 


Product A 
I [^] Product B 


Product C 


1999 2000 2001 

(a) Component bar chart 



1999 2000 2001 

(b) Multiple bar chart 


Multiple bar chart 

A multiple bar chart uses a separate bar to represent each constituent part 
of the total. These bars are joined into a set for each class of data (see Figure 
1.10(b)). The data in Figure 1.10(a) are represented in multiple bar chart form 
in Figure 1.10(b). 


1.9 Pie chart 


A pie chart involves a circle which is used to represent the whole data. Sectors 
of this circle ('pie') are then used to represent the relative frequencies of particu¬ 
lar components of the data. The size of each sector is found by calculating the 
proportion of the whole data represented by the component and then multiply¬ 
ing by 360° to obtain the angle of the sector from the centre of the circle. 

Pie charts give an immediate visual impression of the relative contribution 
of particular components to the overall total. They are widely used in company 
reports and accounts, as in highlighting the relative contribution of ditterent 
product lines to overall profit or of different types of cost to overall cost. How¬ 
ever they are only effective for displaying the contributions ot a relative!) small 

number of separate components. 


WORKED 
EXAMPLE 1.3 


survey of 12 000 television viewers on a particular evening found the following 
ormaLn as to the channels being watched a, 8 p.m. on that evenrng. You are 
ked to construct a pie chart to represent these data. 
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table 1.6 

TV viewing figures 


table 1.7 

Finding the sectors 
of the ‘pie’ 


figure 1.11 

Pie chart for viewing 
figures 


Channel 

Number 

BBC1 

3,000 

BBC2 

1,000 

ITV3 

5,500 

ITV4 

2,000 

5 

500 


Solution 

To find the angle of the circle (‘pie’) represented by each channel, we find the 
proportion of the whole data ( 12,000 viewers) represented by that channel and 
then multiply by 360°. This is done in Table 1.7. 


Channel 

Frequency 

Angle 


BBC1 

3,000 

3,000 

— 1 -x 360 = 

90° 


12,000 


BBC2 

1,000 

1,000 „„„ 
-x 360 = 

30° 



12,000 

ITV3 

5,500 

5,500 

. x 360 - 

165° 



12,000 

ITV4 

2,000 

2,000 

- x 360° = 

60° 



12,000 

Channel 5 

500 

500 

12,000 X 360 

15° 


I he pie chart is then drawn from thic +oki^ 

Thp rnmniPtPH nip ^ • u S tab e usln S a Protractor or angle measurer 
me completed pie chart is shown in Figure i -i-i \m~ , 

ITV3 dominated that evening's viewing Wes ^ 3 6 

share of viewers. 6 gU es ’ Wlth channel 5 having the smaller 
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activity 1 


In actually drawing the pie chart it is helpful to draw the smallest sectors first, so 
that any small inaccuracies in angles will be visually less important when you 
come to drawing the larger sectors. 


Greece is a major export market for your company. You sell a variety of products, 
many of which are age specific. You therefore require a demographic breakdown, 
by age, of the Greek population. Use the information in the following table and a 
spreadsheet with which you are familiar to plot the graphs mentioned. 


Greece: population by age 1999 
'000 


0-9 

1,160 

10-19 

1,425 

20-29 

1,544 

30-39 

1,428 

40-49 

1,263 

50-59 

1,305 

60-69 

1,081 

70-79 

663 

80+ 

331 


1 Construct an ogive (cumulative less than frequency curve) from the data in the 
table above (the first class [0 to 9] is placed in cell A4). 

Remember that an ogive is plotted against the upper class boundary of the 
class interval therefore: put these upper class boundaries in column C (i.e. 
9.5, 19.5, 29.5 .. .). 

Calculate the cumulative less than frequency in column D as follows. 

(a) in cell D4 enter 1,160, 

(b) in cell D5 construct a formula to calculate the cumulative less than 
frequency, 

(c) copy the formula down to cell D12. 

Plot a graph of the cumulative frequency, i.e. the ogive. 

Use the ogive to estimate the population below 25 years of age. 

2 Convert your results into a cumulative percentage less than ogive. 

3 Construct a pie chart from the above data to show the relative importance of 
the age groups specified within the whole Greek population (total for 1999: 

10 , 200 , 000 ). 

4 Comment on your results. 

A response to this activity can be found on p. 459. 


1.10 Lorenz curve 

1 he Lorenz curve is used to contrast the actual distribution of a variable 
rh ^distribution which would have occurred in a situation ot perfect 
equality' The distribution of variables such as Income, wealth, proht, etc, are 
often displayed visually using Lorenz curves. 
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7 


figure 1.12 

Lorenz curve 





51 


As we can see from Figure 1.12, the Lorenz curve uses cumulative percentages 
for each axis; one axis being the cumulative percentage figure for the variable 
(here income), the other being the cumulative percentage figure for the frequent) 
(here people). 


► The diagonal line represents perfect equality; for example 20 per cent of 
all people have 20 per cent of all incomes, and so on. 

► The Lorenz curve itself represents the actual distribution; for example 
20 per cent of all people have only 7.6 per cent of all income, and 

so on ' nt" 

► The vertical distance between the Lorenz curve and the diagonal rep rt -'^ 
inequality; for example the bottom 20 per cent of income earners slu'^ 
have had a further 12.4 per cent of income had perfect equality ex" u 


The Gini coefficient is sometimes used with the Lorenz curve as a n u ‘ 
inequality. It is the ratio of the area between the Lorenz curve and drag 1 
the total area beneath the diagonal: 


of 

to 


► 

► 

► 


If there is perfect equality (i.e. the plotted Lorenz curve is the diag° n ‘ l 

then the Gini coefficient is zero. on tal 

It there is perfect inapiality (i.e. the plotted Lorenz curve is the l u 

axis until the last person who has all the variable), then the t* 1111 

coefficient is one. , >i 

i the 

I he smaller the Ciini coefficient, the greater the equality, anu 
the Ciini coefficient the greater the inequality. 
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SELF-CHECK 

QUESTIONS 


As we shall see from Self-check question 1.6 below, the above conclusions may 
be invalidated should different Lorenz curves intersect one another. In this case 
we may have to trade-off greater equality at one part of the distribution with 
greater inequality at another part, which may render the overall figure of the 
Gini coefficient less useful. 


1.5 Draw a pie chart to display the following data. You can round your result to 
the nearest degree. 


Road accident casualties 2000 

Category Number killed 


Pedestrians 
Pedal cyclists 

Two-wheeled motor vehicles 

Cars & taxis 

Others 

Total 


1.6 (a) Use the following data to draw a Lorenz curve for 1979 on Figure 1.12 
above (p. 20). 

Cumulative per cent of income received by deciles of population. 


Income receivers 1979 2000 


Bottom 10% 

4.7 

3.0 

Bottom 20% 

11.0 

7.6 

Bottom 30% 

17.5 

13.9 

Bottom 40% 

24.2 

20.3 

Bottom 50% 

32.2 

27.5 

Bottom 60% 

41.2 

36.2 

Bottom 70% 

51.8 

46.7 

Bottom 80% 

64.3 

59.2 

Bottom 90% 

79.6 

75.0 

Bottom 100% 

100.0 

100.0 

Gini coefficient 

0.248 

0.337 


1,914 

323 

942 

2,019 

226 

5,424 


(b) Compare the 1979 and 2000 Lorenz curves and comment on any 
changes in inequality. 

(c) Suppose, for 2000, you were to add an extra 10.0 per cent to the 
incomes received by the bottom 60 per cent, 70 per cent, 80 per cent 
and 90 per cent, so that the respective percentage figures become 
46 2 56 7, 69.2, 85.0 (all other figures unchanged). How would this 
affect your Interpretation of income inequality in 2000 as compared 

with 1979? 


Note: Answers can be found on pp. 323-324. 
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REVIEW 

QUESTIONS 



The oldest resident was 95 years old. 

__ Hots mnstruct. 


(a) a histogram of the data, 

(b) a frequency polygon. 


1.2 A survey has been conduct “own ST 

n-^PiiPd to work by the production staff, witn tn 


Distance travelled to work (in kilometres) 


13 1 2 9 27 

4 63 7 2 3 

12 43 6 4 3 

73 26 7 8 2 

48 16 42 3 2 

8 26 49 21 15 

2 53 7 16 21 

57 2 55 18 25 

6 71 8 2 8 

3 4 8 4 3 


The aim of the survey is to investigate the effect on the staff of re oca ^ 
the production facility. The main impact of the move will be detenui ^ 
by investigating the impact on those who travel: 0-5 kilometres, 
kilometres; 10-30 kilometres; >30 kilometres. 

Construct: 


(a) a frequency table of the data, 

(b) draw a histogram of the data. 


1.3 A company wishes to review its distribution operation and trom ^ \ 
sheet records it found that 144 vehicles were loaded in a 24 hour 1 
frequency distribution table was prepared from the data as follows: 


Time to loud (minutes) 


40 4 S 
45-50 
SO ss 
55 60 
60—70 
70 80 


N of vehicles 

4 

14 

17 

44 

59 
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From the above data construct: 

(a) a cumulative frequency curve (ogive); 

(b) from your ogive estimate: 

(i) the median loading time, 

(ii) the interquartile range (the difference between the upper and 
lower quartiles), 

(iii) the number of vehicles loaded in under 52 minutes. 

1.4 The following total expenditure was incurred by a business in the last 
financial year: 


Cost centre 

£ 

Employees 

1,139,410 

Premises 

69,004 

Transport 

52,524 

Raw materials 

83,205 

Admin 

23,242 

Capital financing costs 

71,282 

Total 

1,438,667 


Construct a pie chart to illustrate the data. 

1.5 Whilst investigating the pattern of imports into a target market, a firm 
collected the following data: 


Year 

$ (billion) 

1991 

308.962 

1992 

283.724 

1993 

305.448 

1994 

292.457 

1995 

202.743 

1996 

204.61 

1997 

226.2 

1998 

272.981 

1999 

319.01 

2000 

303.829 


Use a vertical bar chart to illustrate the data. 


1.6 The sales volume (units) by geographical area of three types of product in a 
particular quarter are as follows: 


Model 

North 

South 

Hast 

West 

Export 

Galactic 

4,644 

4,817 

4,055 

4,393 

1,201 

Jupiter 

7,884 

9,735 

6,468 

7,089 

3,067 

Saturn 

6,136 

6,703 

6,497 

5,934 

1,844 


(a) Use a component bar chart to illustrate the data. 

(b) Use a multiple bar chart to illustrate the data. 
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business and ECONOMICS 

quantitative methods FOR bus „ ra nhs/charts illustrate the data in the tab | f . 

1.7 With the use of sui ^_ 

No. of passengers 

(millions) 


Airline . 

Delta 

American 

United 

USAir 

Northwest 

Continental 

CAAC 

A1 Nippon 

Lufthansa 

Aeroflot 

British Airways 

JAL 

TWA 

Alitalia 

SAS 

America West 
Air France 
Air Inter 

Japan Air System 
Malaysia Air 


International 


8.4 
15.2 
11.4 

1.7 

7.6 
3.0 

1.4 

1.7 
16.9 

3.8 
22.0 

9.4 

1.8 
9.1 
9.8 

13.8 

0.1 

0.3 

6.4 


Domestic 

80.5 
65.9 

62.7 

57.8 

37.9 

36.9 

36.1 
32.8 

14.1 

25.2 
5.5 

16.7 

19.0 

11.2 
9.0 

15.6 
1.8 
6.8 

15.1 

7.8 


Source: ICAO 


Note: Answers to Review questions can be found on pp. 376-384. 


Further study and data 


i exts 
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Tn , intema S ! ater '| R ' W " 6) ' ^‘antitattve methods for business decisions. 4* 
Lawson' u L kk na ‘ Thom P“n Business Press, chapters 1 and 2. 

Addison Weskyu 5 ' ““ PU8h ' P ' <199S) - Maths «»<' f "' 

Morris, C (1999f n„ T™' Cha P<“ 1 and 2. 

chapter 5 . Q " mt,ta ‘'ve approaches in business studies. 5th edn, I ltn ‘ 
Oakshott, L. ( 1996 ) p, . n 

Swift » I- (2001), QuantitUr* 1 elemm& °f business statistics. BPP, chapter -• 
|>al Brave Publishers * 1 ^ "'' tll0tls t or business, management and 
lht >'» a sMt.(1997)7)^:J " ,tDn l ~2. 


'6o.nas, '~ 2 - ... 

chapter 1. ll <oti\ ( methods for business studies, Prentice H 1 ' 11, 

Waters, l). (| () 

WisnU*kV*'MwIiIi°st ^ ^ n ^' HUlS ^ bUS ‘ WSS ' 2,Ul edn * Al1diS01 ’ 

suit ss, Pitman, chapters V l oinui atiou quantitative methods fo' 

■' S and 6. 
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you seek to h adS“C'Slowfn?i- y r- reqUlre Wi " depend on the problem 
data you mtght find to, ,h e ^ I o T »f the types of 

statistical sources. Those for the UK P d,K \ lhe rest of the world in various 
National Statistics (ONS). lC main ^ Polished by the Office for 

‘ Ssriszrcxsr t »•——- 

Published |„ , h , A “JJ;* <“1 

source of data for the UK p™™ L ° NS ' * the most im Portant 

► UKBar " 1 the tabl6S USU3lly C0 ™ r 3 ^yearpehod" PKXRted ^ 

Z t'rc'Srr, ,he pL **>■ ^ 

► Economic t ^ Book. Published annually by the ONS 

^ ;r^;r? ed montwy by ,he ° ns - * «* <*. 

Of areas ineHiH 8 K ° Ver ' perhaps - five y«rs. It covers a ranee 

Trends Annual Sunn'ie ""T- employment and tIa de. The Economic 
series of Wh “ Supp, ' ment 15 Particularly useful for obtaining long 

► Regional T "h™ '" ld c|uart '- r ly delta, some series going bark to 1945. 

S rends is published annually by the ONS It inrinH^c 

- down by 

oattern, of h! PU “‘ Shed annuall y b T ,he ONS, contains data on 
da I n household wealth, income and expenditure, together with 
data on demographic, housing and social trends 

“of,!; 5 *™* ° f Sta “ StiCS ' A " ama ' 8am ° f topta are 

► Monthly Digest of Statistics. As the name implies, an amalgam of 

statist.cs published monthly. It covers a wide range of topics, including 
economic, social and demographic. 8 

► General Household Survey. The results of a sample survey of 
households are published annually in the GHS. Five core topics are 

covered - education, employment, health, housing and population and 
family information. 

► Labour Market Trends. A monthly publication covering labour market 
issues and offering detailed information on wage rates, productivity, 
hours worked and so on for various sectors of the economy. 

► Business Monitor MM23. The source of detailed information on prices. 

► Financial Statistics. Monthly publication of the ONS relating to 
financial indices such as interest rates, exchange rates and the monev 
supply. 

► Bank of England Quarterly Bulletin. Published hv the Economics 
Division of the Bank of England, containing a number of articles and 
statistics about the money supply and the financial sector. 

► National Institute Economic Review. Unlike all of the above, this 
is not an official Government publication. However, it does contain 

a statistical appendix which is a very useful compilation of UK and 
international data on a wide range of economic issues. Published 
quarterly by the National Institute of Economic and Social Research. 


CHAPTER 


2 

Central location and dispersion 


Objectives 


Introduction 


When you have read this chapter you 

should be able to: 

► understand and use simple short-hand 
notation widely used in statistics; 

► identify and calculate the various 
measures of central location, such as 
arithmetic mean, median and mode; 

► find measures of central location for 
both ungrouped and grouped data; 

► be aware of the circumstances under 
which the various measures of central 
location will be similar and under which 
they will differ; 

► identify and calculate the various 
measures of dispersion, such as range, 
interquartile range, mean deviation, 
variance and standard deviation; 

► find measures of dispersion for both 
ungrouped and grouped data; 

► understand and interpret measures 
which combine aspects of central 
location and dispersion, such as the 
coefficient of variation. 


We saw in Chapter 1 that the use of tables and charts 
can help by giving a visual impression of patterns and 
trends in data. Meaning can also be given to the original 
(raw) data by calculating various measures of central loca¬ 
tion (average) and dispersion (spread of data around the 
average). 

Answers to the 'Self-check questions', responses to 
each Pause for thought' and answers to the 'Review 
questions' can be found at the end of the book. 


21 Notation 

At least three tvnes r>f 

seeking to desrrm * age are in common us 

median and mode e Th aW f data ' namdy arithmetiC 

(and indeed elsewh he /° rmulae used in theirder 
with Which vn u } make USe ° f a tyP e 0f n< 
y shou ld become familiar. 

Greek letter sig ma = sum of 


variable which arp t , When applj ec j t 

and above the siem° e SUmmed ar e indir Variab,e the values 

throws of a dice ^ 1 S ‘ Sn ' For example ll Ct by the numbers imposed 

throws, we would" Wt ‘ Wish ■« "m a * refers to the scores fn 

Wr te: ‘ (ackl Aether) the scores on the fin 


+ Xj \ x ( 

" 3 + 5 f * (say) 


28 
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Other possibilities might include summing the scores on the second, third and 
fourth throws of the dice: 

4 

X^/ = X 2 + X 3 + X 4 

i = 2 

= 5 + 1 + 6 (say) 

More generally, when we wish to sum over n values of the variable we write: 

n 

X*, = Xi + x 2 + x 3 +... x„ 

/ = 1 

where n can be any number (here the number of throws of the dice). 

Sometimes a particular score on the dice might occur once or more than once; 
in other words that particular score might occur with a frequency Fj. For n 
throws of the dice we might obtain / different scores. We would then write the 
total score as being given by: 

j 

X^,. = F i X i + F 2><2 + F 3 X 3 +. . . FjXj 

/ = 1 

where / is the number of different scores observed in the n throws. 

For example, suppose we are faced with the following results from 12 throws of 
the dice: 

<3 

5, 2, 6, 6, 2, 1, 6, 3, 5, 3, 2, 1 

As chance would have it only five of the six possible scores on the dice have 
appeared (no scores of 4 obtained) in the 12 throws. Here j = 5 and n = 12. 

If we let: X x = 1, occurs with frequency F x - 2 
X 2 = 2, occurs with frequency F z = 3 
X :i = 3, occurs with frequency F 3 = 2 
X 4 = 5, occurs with frequency F 4 = 2 
X 5 = 6, occurs with frequency F 5 = 3 

Note that if we sum the individual frequencies over the five different scores 
observed we get: 

X^ =F i+ F 2 + F 3 + F 4 + F s 

i = \ 

5 

£/•; = 2 + 3 + 2 + 2 + 3 

i -I 

£/? = 12 

i = I 


So the sum of tin* individual frequencies for each of the five different scores 
equals the total number of observations, n, which in this case equals 12. 
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SELF-CHECK 

QUESTIONS 


AND economics 

methods for BUSI 

As already mentioned w 

earlier formula: 


only have 


five different numbers, so / = 5 


in 


our 


5 

2 

»=i 


F,X S + ft*. * FsXs 

3 ) + (2 x 5) + (3 x 6) 

_2 + 6 + 6 + l0+l^ 


^X-F.X, +F 2 X 2 + 

= (2 x 1) + (3 x 2 ) + (2 * 


lF i X i = 42 

i=1 f this shorthand notation involving the sigma 

You will come across the use 0 subsequent chapters. It will therefore be 
sign throughout this chaptei^an^ £ any father 

helpful to practice using the notan 

2.1 Expand (write out fully) each of the following expressions: 

(a, IX, (h, IX, Mg* MSW 

Itl 1=2 ,=1 


2.2 Write 


out each of the following expressions using sigma notation: 


(a) X ± + X 2 + X 3 + X 4 + X 5 

(b) X 5 + X 6 + X 7 

(c) FjXi + F 2 X 2 + F 3 X 3 + F 4 X 4 

(d) F 2 X 2 + F 3 X 3 + F 4 X 4 

2.3 The following scores are obtained for 15 throws of a dice: 
5, 6 , 4, 3, 6 , 5, 1, 1, 2, 4, 3, 1, 2, 4, 6 


Find jF'X, 

1=1 

where j = the number of different scores observed 
F, = frequency of a particular score 
X, = particular score 


Note: Answers can be found on p. 325. 


?.".^.. IV,eaSUreS ° f central location 

A number of measures of is.- , ifl in 

attempt to give meanine to , ‘ . cat,on or 'average' are widely used u ‘ 

median and mode for both , data ‘. Hcre we c °nsider the arithmetic mea • 

grouped data (items only identifieT^i individual items known) ' l,u 

V ictentified within class intervals). 
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The arithmetic mean 

This is the simple average of everyday use, and is often represented by the 
symbol X. 

Ungrouped data 

Where individual or ungrouped data are available, the following formu a is 
commonly used: 

n 

I*. 

- — 
n 

X - arithmetic mean 
Xj = value of each item of data 
n = number of items of data 


X 

where 


WORKED 
EXAMPLE 2.1 


Suppose the daily takings of a small corner shop are as shown below: 

Mon. Tue. Wed. Thur. Fri. 

Daily sales £620 £660 £600 £480 £710 


Solution 

Using the formula: 


x = ,=1 


ri 

X L +X + X 3 +X,+X S 620 + 660 + 600 + 480 + 710 


= 3 ’°I2. - £614 
5 

The average daily sales are £614 and this represents a typical figure around which 
the rest of the data will cluster. 

Mo^mifalhcTs we noted in Chapter 1, data are grouped into a frequency table 
•fh various class intervals. To deal with such data the simplifying assumption 

any liven Cass Man* the items of data fa.. the 

™ass midpoint This is equivalent to assuming .ha, the items o, data ate evenly 

spread within any given class interval. 

We may now make use of the following formulae: 




X = 


t F > 

/ = 1 


F t Xy + F 7 X 2 + . ■ • Fj Xj~ 

F x + F 2 + — F t 
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where F, = frequency of ith class interval 
X, = mid-point of ith class interval 
j = number of class intervals 


WORKED 
EXAMPLE 2.2 


Suppose that a survey of the prices of 60 items sold in a shop gives the 
below (five class intervals for prices): 


results 


Price of item (£) 

Number of items sold 

1.5-2.5 

15 

2.5-3.5 

2 

3.5-4.5 

19 

4.5-5.5 

10 

5.5-6.5 

14 


table 2.1 

The average 
(arithmetic mean) 
price of items sold 


Solution 

We can use our formula for grouped data to calculate the average (arithmetic 
mean) price of the items sold (see Table 2.1). 


Price of items (£) 


1.5- 2.5 

2.5- 3.5 

3.5- 4.5 

4.5- 5.5 

5.5- 6.5 


Class mid-points 
X, 


2 

3 

4 

5 

6 


5 



/=1 


Number of items sold 

F, 


F,X, 


15 

2 

19 

10 

_14 

5 

= eo 


30 

6 

76 

50 

84 


Jf,X, = 246 


In other words the 


average price of the 


items sold in the shop was £4.10 pence. 


DID YOU KNOW? 

The mean time for a person speaking each day is /„ c 
than 12 minutes, and for a doubling of nan,nr w 
held on file by businesses is less than oc ument& 

31 years (despite IT!) 


mating 


The median 

Ibis is that value which divides the data 
set into two equal halves; 50 per cent of 
values lying below the median, and 50p tM 
cent of values lying above the median- 
llte approach we consider below u> 1 
’e median value for both grouped a ,H 


arithmetical methods tor esti 

ungrouped data. 

















CENTRAL LOCATION AND DISPERSION 33 


Ungrouped data 

To find the median: 


(i) Construct an array (i.e. place the data in numerical order - whether rising 
or falling). 


(ii) Find the median position 


— where n = number of values. 
2 


(iii) Find the median value. 


WORKED Find the median value of machine output over a five day period: 

EXAMPLE 2.3 



Daily output 


310 

340 360 320 

330 


Solution 

Step 1 Place in an array 310 320 330 340 360. 

~ n "H1 

Step 2 Find the median position using the equation where n = the number 

5 + 1 

of values. Median position = — = third item. 

Step 3 Read the value of the third item in the array, i.e. 330 units. 

Note: Where there is an even number of items there will be two middle items. If 
so, take the average of these two middle items. 


ThTfoHcwing^steps can be used to calculate arithmetically the median value for 
data presented in a frequency table: 


► 

► 

► 

► 


id the median position. 

id the class-interval in which the median observation lies 
sume that all items in this class-interval are equally spaced. 

timate the median. 


WORKED 
EXAMPLE 2.4 


Find the median 


height of students from the data given 


in Table 2.2 (p. 34). 


Solution 


n + 1 

Median position - - 

20+1 

2 

= 10.5 
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The class interval in which this median position lies is 170-175. 


table 2.2 

The height 
of students 


Heights (cm) 


150 and 
155 and 
160 and 
165 and 
170 and 
175 and 
180 and 
185 and 


under 155 
under 160 
under 165 
under 170 
under 175 
under 180 
under 185 
under 190 


Frequency (number of students l 


1 

1 

2 

3 
6 
2 

4 

_1 

20 


The median value can be found using the formula: 

number of observations to median position 


LCB + class width x 


total number of observations in median class interval 
Where LCB = lower class boundary (of median class interval). 

Median = 170 cm + (5 cm x —) 

6 

Median = 172.9 cm 


m ed?„ U h e eS. g ' VeS 3 ViSUa ' PiCtUfe °' th6 USe ° f *>"»«■ «ing ,he 


figure 2.1 

Median class 
interval for table 2.2 



w/vw! of f 170 cm an<j 9n upper class rr faiis has a iower c,ass b ° un 

of 10.5 students. * POsitio " which occurs* SeVen 

- with a cumulative frequ 
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Clearly the median value is larger than 170 cm as we wish to go a further 
3.5 students (observations) to reach the median position. However there are six 
students assumed to be equally spaced along this class interval of 5 cm. We 
therefore wish to go 3.5/6ths of the way along the 5 cm interval in order to find 
the median height. This gives the formula: 

3 5 

170 cm + (5 cm x —— ) = 172.9 cm which we noted above. 

6 

We have already noted in Chapter 1 (p. 12) how the use of cumulative frequency 
curves (or ogives) can help find the median value by graphical methods. 


The mode 

This is that value which occurs with greatest frequency. When data are grouped, 
then the class interval with the highest frequency is referred to as the modal 
class interval. In Table 2.2 above the class interval 170-175 cm is the modal 
class interval, with six students in the interval. 


2.3 Normal and skewed distribution 


When the set of data is distributed in a perfectly symmetrical way, as in Figure 
2.2(a), then all three types of average have the same value. Such a symmetrical 
distribution is often referred to as a normal distribution. However when, as 
is more usually the case, the set of data is skewed in one direction or another, 
as in Figures 2.2(b) and (c), then the three types of average will cease to be 
identical. In fact the arithmetic mean will always be most heavily influenced 
by the direction of skew, i.e. the direction being described by the tail of the 
distribution, which represents the side of the distribution with fewest observa¬ 
tions. In other words, the arithmetic mean will be most affected by a few 
extreme values, whether higher values (skewed to the right) or lower values 
(skewed to the left). 


figure 2.2 

Normal and skewed 
frequency 
distributions and 
•measures of central 
■ocatlon 



Median (X) 
Mode 

;a) Normal distribution 


Variable 
(X,) 


cca> variable 



(b) Positively skewed 
distribution 
(skewed to right) 


(c) Negatively skewed 
distribution 
(skewed to left) 
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, to the rieht or left, you must be careful i n 
When data are skewed to the "gn / * 


When data are skewea ro me u 6 --' “ inter¬ 

pret^ any loose talk of 'average' values. For example if the distribution of 

wages In a firm is skewed to the tight, as m Figure e employers mlght 

claim an 'average' wage of, say, £180 per week and the trade union an 'ave, agf 
wage of, say, £160 a week and both might be right! However the employ 
may be selecting the type of 'average' (arithmetic mean) most favourable,« 
showing them as a high paying company and the union may be selecting the 
type of average (median) most favourable as a basis for pressing for higher 
wages. 


PAUSE FOR THOUGHT 2.1 


In what direction would you expect the UK income distribution to be 
skewed? How might this affect median and mean incomes? 


WORKED 
EXAMPLE 2.5 


The following distribution shows the weekly output of production workers in the 
company: 


table 2.3 

Weekly output of 
production workers 


Output (units) 

Number of employees 

100-160 

1 

160-180 

5 

180-200 

10 

200-220 

35 

220-240 

55 

240-260 

74 

260-300 

20 


200 


(i) Find the arithmetic mean of the weekly outDut 

(ii) Find the median weekly output. 

(iii) Why do (i) and (ii) differ? 

Solution 

(i) Since we are dealing with rr^. 

variable X,. We then use the formula^ 9 ^ mUSt take class mid 'P oints for th<? 

i/;*, 

x = ^L~ 

if. i 

i -1 

where F, = class frequency 

K = class mid-point 

j = number of ph C e ■ * 

1 c,ass intervals 
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X, 

(class mid-points) 

F, 

(class frequency) 

F,X, 

130 

1 

130 

170 

5 

850 

190 

10 

1,900 

210 

35 

7,350 

230 

55 

12,650 

250 

74 

18,500 

280 

20 

5,600 


7 

7 


£f,= 200 

YfiXr 46,980 


i-1 

/ = 1 


if,*. 


X = — 


If. 

1=1 


46,980 

200 


= 234.9 


The arithmetic mean of the weekly output is 234.9 units. 

n + 1 201 


(ii) The median position 


= 100.5 employees. 


2 2 

The median class interval for output is 220-240 units, as shown in Figure 2.3. 


figure 2.3 

Finding the Median 
Value 


100.5 



,1 ina nur formula for the median, and assuming an equal spacing of the 55 
eXeJtnTthe median Cass interva, of 20 units of output, we nave: 


LCB + class width x 


number of observatio ns to median position 
-1 T _place infprv? ‘ 




i.e. 


/on ^ 49-5 ) = 238 units 
Median = 220 + (20 x 55 
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r 


(iii) The arithmetical mean output (234.9 units) is lower than the median „„ 
(238 units). We would expect this to be the case since the data are * ' 
skewed to the left. It is similar to Figure 2.2(c) and the arithmetic mean 
(simple average) will be pulled down by the few extremely low values. 


PAUSE FOR THOUGHT 2.2 


Before calculating the solution to Self-check question 2.6 below, look 
briefly at the table of data and decide on the direction of skew and the likely 
impact this will have on the mean and median values. 


SELF-CHECK 2.4 Passenger cars produced for export (X,) in thousands in each month during 

QUESTIONS the past year were as follows: 


69.3 75.5 76.3 94.4 88.7 77.2 86.4 75.5 79.0 83.3 82.2 78.1 


(a) What is the value of X , when: 

(i) / = 4 

(ii) 1=7 

(iii) / = 10 

(b) What is the mean monthly production of cars? 

(c) What is the mean monthly production of cars over the final six months? 

(d) What is the mean monthly production of cars during months 4-9 inclusive? 

2.5 The following figures show the annual salaries in £s of 20 workers in a 
small firm. Calculate the arithmetic mean, median and mode salary and 
comment on your results: 

15,180 19,870 14,375 15,767 15,870 15,180 14,375 36,938 

15,180 46,132 15,525 19,600 14,375 23,069 16,767 16,767 

17,880 14,375 14,375 14,375 

2.6 An investment analyst receives the following table of data showing the 
percentage changes in labour costs of senior managers in a multi national 
company over a 12 month period. The highest change observed was +172 
per cent for a senior manager In one of the plants in Latin America. 


Percentage change Frequency 


-5 to under 
0 to under 
5 to under 
10 to under 
15 to under 
20 to under 
25 to under 
30 to under 
35 to under 
40 to under 
100 to 


0 

2 

5 

32 

10 

25 

15 

10 

20 

8 

25 

3 

30 

2 

35 

5 

40 

4 

100 

3 

172 

4 
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(a) What is the mean annual percentage change in labour costs for senior 
managers in this company? 

(b) What is the estimate of the median value? 

(c) What is the modal class interval? 

Note: Answers can be found on pp. 325-328. 


2.4 Measures of dispersion 


As well as measures of central tendency or average, it is helpful to have a 
measure of the extent of dispersion or spread around that average. The for¬ 
mulae and brief outlines for the most widely used measures of dispersion are 
presented here. 

As we can see from Figure 2.4, two distributions showing the prices of items 
sold in a shop may have similar measures of central location but may be very 
different in terms of dispersion around any 'average'. Clearly distribution B is 
more widely dispersed than distribution A around the same arithmetic mean X. 

► Range Perhaps the simplest measure of dispersion is to take the absolute 
difference between the highest and lowest value of the raw data. In 
Figure 2.4 the range for distribution A is £30, but the range for 
distribution B is higher at £50. 

► Interquartile range This is the absolute difference between the upper 
and lower quartiles of the distribution. We saw how to calculate these 
quartiles in Chapter 1, p. 12. 

Interquartile range = upper quartile - lower quartile 


figure 2.4 

Same mean (X), 
different dispersion 
for two distributions 
A and B 
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siness and economics 

quantitative methods for range This is half the interquartile range, i, ls 

► Seroi ' inter r afcdthe quartile deviation. 

sometimes upper quartile - lower quartile 

Semi-interquartile range = 2 

„f dispersion considered so far have involved comparing „ 
All the measures of ctive frequency distributions, such as the ma< 

different points on range ) or the upper and lower quartiles ( the 

imum and mimmu P ^ ( (um w more use ful measures of dispersion which 

interquartile range). respective frequency distributions, 

compare all the points on the respect 


Mean deviation 


DID YOU KNOW? 

The mean income for the top fifth of UK households is 
more than 20 times higher than that for the bottom fifth 
of UK households, before the impact of the tax and 
benefit system. After the impact of taxes and benefits, 
the respective mean incomes only differ by a factor of 
less than three. 


MD = £—11 1 4 ~ 3 1 + 13 - 31 |-1| + 

3 = - 


Mean deviation with ungrouped 


data 


More generally MD = 


Sl*i - X\ 

i i 


where 


I - modulus (| e icdnrt 

"~- = nUmber ot observations S ' en) 


This is the average of the absolute devia¬ 
tions from the arithmetic mean, ignoring 
the sign. Of course if we did not ignore 
the sign, the average deviation from the 
arithmetic mean would always be zero! 
When two straight lines (rather than 
curved brackets) surround a number or 
variable it is referred to as the modulus and means ignore the sign. 

The mean deviation is only rarely used as a measure of dispersion. How¬ 
ever it is worth some consideration as it is the basis for the more widely used 
measures of variance and standard deviation. 

Ungrouped data 

Suppose three throws of a dice yield the following values: 

X, = 2, X 2 = 4, X 3 = 3 

Clearly the arithmetic mean, X, is 3. To find the mean deviation (MD) ^ 
need to find the average deviation from the mean, ignoring the sign: 

MD = l X f ~ ^| + \ X 2 ~ X| + |X, - X| 
n 


l+H + 101 = 2 

me average sco," of° 3 . °" tlUee throws <* the dice is f of a unit away W 














CENTRAL LOCATION AND DISPERSION 41 


Grouped data 

If we return to our example of the prices of 60 items bought in a shop 
(Table 2.1), we can now use the mean deviation to give a measure of dispersion. 
We noted (p. 32) that the average price (X) was £4.1 per item sold. We now 
apply the formula for mean deviation with grouped data: 


Mean deviation with grouped data 

tw - x| 

MD = —- 

& 

/ = 1 

where £ = class frequency 
X, = class mid-point 
X- arithmetic mean 
j = number of class intervals 
| | = modulus (i.e. ignore sign) 


PAUSE FOR THOUGHT 2.3 


How might you expect the mean 
to change over time? 


deviation for the age of the UK population 


WORKED 
EXAMPLE 2.6 


Find the mean deviation for the data shown in Table 2.1 (p. 32). 


Solution 


Price of item Class mid-point 
(£) Xi . 


1.5- 2.5 

2.5- 3.5 

3.5- 4.5 

4.5- 5.5 

5.5- 6.5 


Note: X = 4.1. 


Number of items 

F, 

\x,-x\ 

F,\X,-X\ 

15 

|-2.1| 

|-31.5| 

2 

1—1.11 

| -2-2 | 

19 

|-o.i| 

1 -1-91 

10 

1 0.91 

I 9.01 

14 

1 1-9| 

| 26.6 | 

60 


71.2 


S w - *i 

md = -^-t- 

1*5 


71.2 

60 


1.19 


The average deviation from 


the arithmetic mean price is £1.19 per item. 
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Variance 

»- •» 

»h. i». »■•»“ i”" 1 " “ ,l ” J >»«« Cft* 

11 ci 1^1 round brackets. 


the 


more usual round brackets. , 

The resulting measure is now called the vanance, which can , hen 

regarded as the average of the squared dev.attons from the mean. Note , n * 
th! units are squared, e.g. square pounds, square metres, etc. ’« 


Ungrouped data 


Variance for ungrouped data 

£<*, - *> 2 

Variance (s 2 ) = —- 

n 


WORKED 
EXAMPLE 2.7 


Suppose we have to find the variance of the following eight items of raw data. 
8, 10, 12, 14, 16, 18, 20, 22 


Solution 


n 


X = 

Yx, 

h _ 120 

n 8 

= 15 

X, 

(X, - X) 

(X - X) 

8 

(-7) 

49 

10 

(-5) 

25 

12 

(-3) 

9 

14 

(-D 

1 

16 

( 1) 

1 

18 

( 3) 

9 

20 

( 5) 

25 

22 

( 7) 

49 



168 


Bx, - x) 2 

s 2 = hi __ _ 168 _ 

n 8 = 21 sc l Uar e units 


Varian ce (s 2 ) = o -| 


SQyare units 


Grouped data 

the formula for 


calculating the 


variance when data 


are grouped is as 


S follow 
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Variance for grouped data 

Xw - x ) 2 

Variance (s 2 ) = —- 

t F ' 

/=i 

where F, = class frequency 
Xj = class mid-point 
X = arithmetic mean 
j = number of class intervals 


However it can be shown (see p. 47) that this formula simplifies to the 
following: 


Variance for grouped data: operational formula 


1F,X? 


Variance (s 2 ) = — 


7 * 

t F > X ' 

i=l 


If, 

i = 1 

hxj 


IF, 

1.1 1 


Variance (s 2 ) = 


2\ - iiL 


IF, 

i=1 


-(X) 2 


This is a useful formula for finding the variance since we will already have found 
the second term when calculating the arithmetic mean. 


WORKED 
EXAMPLE 2.8 


Let us apply this formula to our earlier problem involving prices of items sold in 
Worked Example 2.6. 


Price of item 
(£) 

1.5- 2.5 

2.5- 3.5 

3.5- 4.5 

4.5- 5.5 

5.5- 6.5 


Class mid-points 
X, 


2 

3 

4 

5 

6 


Number ot items 



F, 

FXj 

15 

30 

60 

2 

6 

IS 

19 

76 

304 

10 

50 

250 

14 

84 

504 

60 

246 

1,136 


s 


2 


S F,X? 

1?, 




7 
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s 2 = 


1,136 ( 2461 = 18.9 - (4-1) 2 


60 


60 


Note: £fvX? = F,Xl +F 2 Xl + --- F l X f 

•„ nniv on the X, term and not on F,X,. 

In other words the square is only on me , 

We can obtain the column F,X? by either multiplying the F,X, column by the 
corresponding X, or by multiplying the column by the correspond.ng X?. Check 

this vourself in the table above. 


Standard deviation 

Rather than use square units, it is more realistic to express solutions in terms of 
single units. The standard deviation does this by taking the square rootol 
the variance. 

The standard deviation (s) is then the square root of the average of the 
squared deviations from the mean. The following formula applies to both 
ungrouped and grouped data: 

Standard deviation = ^variance 
i.e. s = ^ 


WORKED We can use our earlier data on 20 student heights to illustrate the calculation of 

EXAMPLE 2.9 both the variance and the standard deviation as measures of dispersion. 


Solution 


Heights (cm) 


150 and under 155 
155 and under 160 
160 and under 165 
165 and under 170 
170 and under 175 
175 and under 180 
180 and under 185 
185 and under 190 


Number Mid-points 



1 

1 

2 

3 
6 
2 

4 

_1 

20 


152.5 

157.5 

162.5 

167.5 

172.5 

177.5 

182.5 

187.5 


X = 12_ 

8 

If, 

i l 


3,445 

20 172.25 ent 


F,X, 

FjXf 

152.5 

23,256.25 

157.5 

24.806.25 

325 

52.812.50 

502.5 

84.168.75 

1,035 

178.537.50 

355 

63.012.50 

730 

133,225.00 

—18ZJ5 

35,!5&25 

3,445 

594.975 
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S 2 



8 \ 2 


/=1 

8 

/=! J 


s 2 _ 594,975 
~20 


3,445 

20 


\2 

) 


S \ ~ 29,748J5 ~ 29,670.06 
S ~ Varia nce = 78.69 square cm 
s = Standard deviation = V 78^69 
'•e. s = 8.87 cm 


ACTIVITY 2 


potential of C0mmiSS ' 0ned b V a manufacturing company to examine the 

demoerlhv of r 6 martet Pa,t ° f ' he investi « a « 011 requires work on the 
the follow' ^ , GreeCe ' Use a spreadsheet with which you are familiar to derive 
he following information and data. Here the formulae and functions are given 
in terms of Microsoft Excel. 


1 Preparation 

Type the following data and headings exactly as shown below 


A 


B C D 


1 Greece: Population 1986-1999 

2 

3 Area (sq km) 


4 

5 

6 


7 

1986 

8 

1987 

9 

1988 

10 

1989 

11 

1990 

12 

1991 

13 

1992 

14 

1993 

15 

1994 

16 

1995 

17 

1996 

18 

1997 

19 

1998 

20 

1999 


131,908 

population 

9,642,505 

9,729,350 

9,789,513 

9,846,627 

9,895,801 

9,934,249 

9,963,604 

9,983,490 

10,004,401 

10,038,672 

10,088,700 

10 , 200,000 

10,311,300 

10,422,600 


% change 


Pop. density 


2 Calculate the percentage change In population 

(a) Create a formula in C8 ( not 01 ) to give the percentage change of 
population from 1986 to 1987. 

(b) Copy this formula down the column in order to calculate the percentage 
change for all the years up to and including 1999. 
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The formula in C8 needs to be a relative reference (cell references th at 
when copied to new locations). IUst 

3 Calculate the population density 

A In cell D7 create a formula, by referencing cell B3 to calculate the 
population per sq. km (population density). Before this formula i s C0Dj 
down make cell B3 an absolute reference. Cell references in formulae^ 
must be made absolute if they are not to be adjusted when copied. t 0 
create an absolute cell reference in Excel do the following: 

1 enter the formula in the usual way; 

2 position the cursor within the formula bar to the right of the cell 
reference that is to be made absolute (it will change from a cross 
to an T shape); 

3 press the F 4 key once and a $ sign will appear with the cell reference 
($B$3). 

B Copy the formula down the column. 

A formula containing an absolute cell reference can be copied anywhere 
in the spreadsheet and it will continue to refer back to the absolute cell 
(in this case B3). 


4 Construct a graph 

Construct a graph showing both the population and the percentage change in 
population. . 


5 Calculate ,he mean wntage change in population for the period. 

6 Estimate the future population 

for 200oZZZ 2 ° hange ' n POpUlatior) ,0 estimate the P°P ula,i 
lb) Pot these projections on to you, graph 

response «o« fea o,^ canbefou g nd Ph np 46i 


• 2 : 5 .. C . 0e .™f. ie . n . , .. 0 !.. variat| on (CofV) 

***••••••••* 

This is a widely used measure f 

of dispersion (the standard dev/t^^ ^ s P ers i°n. It relates an u/w/r 
mean around which the OkmJ- t0 the <*<"'* value of the 
standard deviation (s) 0 f ten S® takes P |a «. Clearly a data set 
dispersion than a data set T ^ mean «f ten units has a 
units but a still higher absolute" 1 ' ' *' *''^ er a Usolnte standard d 
We wml « say that: mcan W of 50 units. 
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SELF-CHECK 

QUESTIONS 


REVIEW 

QUESTIONS 


The coefficient of variation can be expressed (as here) as a decimal, or as a 
percentage. To obtain a percentage we simply multiply by 100, giving 100 per 
cent and 40 pei cent for this example. The data set with the highest coefficient 
of variation has the greatest relative dispersion. 


Proof of operational formula for variance 

The formula for the variance (p. 43) can be made more useful by simplifying, 
as follows: 


s 2 = 


IW - X) 2 

Z»5 


S 2 = 


s 2 = 


- 2X,X + X s ) 

Za 

Z^*? , f ZW . x’l'v 
Z'i TT + TT 


(expanding the bracket) 


s 2 = 


Irx 2 

Sr 


2X 2 + X 2 


I F,Xf 

It 


S 2 = ^ 


s 2 = 




2.7 Look back to the data in Self-check question 2.4 (p. 38). Find the variance 
and standard deviation for monthly production of passenger cars for export. 

2.8 Look back to the data in Self-check question 2.6 (p. 38). Find the variance 
and standard deviation for the percentage change in labour costs of senior 
managers throughout the multinational company. 

Note: Answers can be found on p. 328. 


2 1 A frozen food company has a contract with Farm A, to provide peas tor 
their processing plant. The daily supply of peas (in tonnes) over the past 
two weeks from Farm A has been as follows: 

Fantt A. 

357.38 262.80 319.95 412.90 398.46 330.33 341.27 

329.33 332.04 309.42 229.88 259.43 337.99 383.31 












r 
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1( % 


The frozen food company also contracts with Farm B The daily SUDn) 
peas (in tonnes) over the past two weeks from Farm B has been as^ 

364^*295 18 352.92 380.14 314.39 343.87 290.43 

29F21 306.03 348.88 333.81 291.99 324.79 315.25 

Calculate the respective values for each farm in terms of: 


(a) the mean; 

(b) the range; 

(c) the interquartile range; 

(d) the semi-interquartile range. 

Use your calculations to compare and contrast the relative contributions of 
each farm to the frozen food company. 


2.2 Use the data for Farm A and Farm B in question 2.1 above to calculate the 
mean deviations for both data sets. What do your results show? 


2.3 A company requires that chilled food cabinets in its supermarkets must 
maintain an average hourly temperature of 3.75°C ± 0.5°C. The manager at 
one of the supermarkets suspects that the performance of one of the shop's 
cabinets fails to meet this standard and therefore decides to monitor its 
performance hourly over a 30 day period with the following results: 


Temperature (°C) Frequency 


0-1 

1-2 

2- 3 

3- 4 

4- 5 

5- 6 

6- 7 


1 

11 

123 

322 

223 

39 

1 


grouped dataj^asses^whethe^h 6111 ^ 311116 

policy. t * le ec J u ipment conforms to the compare 


ine number of visitors in 

season were recorded as follows andS ’ ^ “ amusement P ark 




Visitors (OOO) Frequency 


6-8 

8-10 

10-12 

12-14 

14-16 

16-18 


4 

17 
61 
47 

18 
3 


Calculate the mean att 

a " d S,anda * ^‘viarion for ^(tf ng ^ S ~ -ason an 

at attenda nce. Comment on y, 
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2.5 A new production line had a mean daily rejection rate of 196 units with a 
standard deviation of 21.48 units in the first three months of operation. In 
the next three months the mean daily rejection rate was 94 units with a 
standard deviation of 12.62 units. Find the coefficient of variation in each 
case and use it to comment on the relative dispersion of the production 
line over the two time periods. 


2.6 A brewing company wishes to launch a new canned lager. It has close links 
with a major supermarket chain which will only permit a promotion of the 
lager in two of its stores. The selected stores monitor their sales of all 
brands of canned lager over a weekend period with the following results, 
which were communicated to the brewery: 


Sales value of 
lager (£) 

Frequency 

Store A Store B 

0-2.5 

27 

1 

2.5-5 

114 

3 

5-7.5 

333 

31 

7.5-10 

530 

142 

10-12.5 

504 

328 

12.5-15 

334 

498 

15-17.5 

121 

504 

17.5-20 

29 

351 

20-22.5 

5 

110 

22.5-25 

2 

29 

25-27.5 

1 

2,000 

3 

2,000 


What advice would you give the management of the brewery on the differ¬ 
ences and similarities in the pattern of expenditure in stores A and B. You 
may wish to consider: 

(a) Illustrating the data graphically - using the same axes for each store. 

(b) The mean consumer expenditure in each store. 

(c) The variability of customer expenditure. 

(d) Which supermarket provides the best opportunity for a successful 
promotion of the new lager. 

Answers to Review questions can be found on pp. 384-392. 


Further study and data 
Texts 

Bancroft G and O'Sullivan, G. (1993), QmMollve fo, no.»»>«>« m.l 

r " ^ ter ’ t $£3^—V <.«< 

CUr ^n, Intematicrnal l lrompson lltrslness Press, cl.ap.ers 3 and 1. 







chapter 3 


Regression and correlation 


Objectives 


Introduction 


By the end of this chapter you should be 

able to: 

► calculate the unique ‘line of best fit’ 
(least squares line) between a 
dependent variable (Y) and an 
independent variable (X). In other words 
you should be able to find the equation 
of that unique line Y = mX + c which 
minimises the sum of squared 
deviations from the line. This is 
sometimes called the linear regression 
line. 

► calculate the coefficient of 
determination ( R 2 ) and Pearson’s 
coefficient of correlation (R) between 
the two variables, thereby helping to 
establish the ‘goodness of fit’ of your 
least squares line. 

► calculate and interpret Spearman’s 
coefficient of rank correlation'. This 
seeks to establish whether or not there 
is a strong linear relationship between 
two sets of ranked data. 

► appreciate the approach to finding the 
regression line (linear or non-linear) 
which best fits more than two variables. 
In other words be broadly familiar with 
the techniques and problems of 
multiple regression analysis. 


Business activity is an aggregate of tens of thousands of 
decisions taken daily by small and large firms alike. Deci¬ 
sions bring about change and, hopefully, such change will 
benefit the firm and the wider economy. For example, 
‘If I spend more on advertising, sales volume will grow', 
'If 1 invest in the latest equipment, productivity will 
increase'. 

The cornerstone of many of these expectations is 
usually past experience by the firm in question, or the 
widely reported experiences of other firms. Of course we 
can be more confident that the expected outcome will 
in fact occur, if it is supported by the recorded experi¬ 
ences of a large number of decision takers. 

The statistical techniques of regression and correla¬ 
tion are important in helping determine whether such 
relationships can be established. Usually the firm is inter¬ 
ested in whether changing some independent variable 
(say advertising) will have the desired effect on some 
dependent variable (say sales volume). As we shall see, 
the section on regression will help us identify the equa¬ 
tion of the line or curve which 'best fits' the recorded 
data, often displayed in the form of a scatter diagram. 
The section on correlation will help us determine 
'how well' that line or curve actually fits that recorded 

data. 

Throughout this chapter we shall present two altern¬ 
ative approaches to calculation. The first will use the 
original data for the dependent and independent van- 
ables, Y and X respectively. The second will use redefined 
data for these variables, giving us the opportunity to use 

u • , vnHinv formulae'for our calculations. Although both approaches 

wm be'presented, you can concentrate on *atjpproach followed by your 
lecture course (or the approach you find the easiest.). 
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/c pl f check question', responses to each 'Pause 

.. “r-:: r - *—*■ «* -« 




3.1 Regression analysis 

#•••••••••• 

Regression analysis involves establishing a relationship between two 
Variables Here we initially illustrate in terms of a Unear relationship, alth^ 
more sophisticated approaches can establish similar non-.near ,eU ion% 
In ‘staple’ regression analysis we assume two variables only, y the depen*,, 
v the indeoendent variable. 




Simple linear regression 


We consider the equation of a straight line (linear equation) in more detail in 
Chapter 12 (p. 306). Here we note that the general equation of a straight line 


figure 3.1 

Finding the least 
squares line (LSL) 
Y = mX + c 


Y =mX + c 

where Y = dependent variable 
X = independent variable 
m = gradient of the line 

c = point where the line intersects the Y axis 


Figure 3.1 shows a scatter diagram with dots representing 
ordinates (. X,Y ) plotted on the diagram. Our aim is to find 
(7 = mX + c ) which best fits the data. We use the symbol 
estimated value of Y from the least squares line. 


the different co- 
an estimate ^ 
Y to refer to th; 

























WORKED 
EXAMPLE 3.1 
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One nn K- 1 - '-'UKKbLATION 53 

w^“°ha^ )l 'he sum Of 

"Oted in Chapter 2 for ,he mean del < With the same P™bie m 

± and then ignoring them (i.e taking the°m Si T ° f ldentif y ,n S these signs 
all these deviations so that all the Z s In h ) ' “ “ much easiCT '° *4* 
We shall therefore me the I . 8 be P osltive - 

(straight) line which minimises the sum"^,^“ “ ° Ur " ne of best fit - 'hat 
observations from the line. f the Squared devia tions of the actual 

n other woids the least squares line is that (unique) line for which- 

-<*?+.. rl 2 ic O _ 


d\ + d\ + 


■ dl is a minimum. 


In terms of Figure 3.1, the least squares line will be such that j^df is a minimum. 

/=1 

of the h fon OPe<m) and interce P' < c > of this unique line can be found using either 
. H’ff owln § two approaches. In each case the formula outlined is obtained 
ta differentiating expressions in order to find those values of m and c which 
will minimise the sum of squared deviations from the (straight) line containing 

t ose values. Proofs of these formulae can be found elsewhere (see 'Further 
study', p. 76). 

Here we present both types of approach in solving regression problems. 
Remember to use the approach followed by your course or the approach you 
find the easiest. 

Approach 1: Using original data (X, Y) 

If we use the data given in the question as they stand (original data), then the 
following formulae will help us calculate m and c for the least squares line. 
These formulae make use of the sigma notation discussed in Chapter 2, p. 28. 


Original data formulae 


-1 x,Zy, 


m = 


i =1 


/ = 1 /=1 


n^X? - 


/ = 1 


I"- 

l ' =1 


c - Y - mX 

where n = number of observations 
X, Y = Arithmetic means of X /( Y, 


Suppose we wish to calculate the least squares 
shown on the scatter diagram (Figure 3.2(a)). 


lino tor the four observations 
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figure 3.2 

Using original data 
(X ff V) to calculate 
least squares line 



Solution 

we can set the data oat in the columns needed to calculate m and c us,n g *e 
above formulae, as follows: 


X, 


Vi 


X,V 


Xf 


1 

2 

3 

4 

IX = 10 


0.5 

2 

4.5 

5 

IV = 12 


0.5 

4 

13.5 

20 

IXV = 38 


1 

4 

9 

16 

IX 1 2 3 4 = 30 


Note: For simplicity we omit subscripts, etc. in the formulae. 
n = 4 observations 




X = 


= — = 2.5 
4 


n 4 

m= nXxV-IxIV 

n 5 > 2 -( X *) 2 

m = 4(38} ~ ( 1°)(12) 
4(30)-(10) 2 

m = 1.6 

c = V - mX 


152 - 120 32 

120 - 100 ~ 20 


= 1.6 



























figure 3.3 

Ch anging the origin 
from zero to the 
Point of means (X,Y) 
to es tablish the 
codin g formulae. 

Here a single 

Nervation (9, 8) 
is Expressed as 
(4 -2) using the new 
!* es (x,y) when 
* = 5 and ?= 6. 


'• e - c = 3 _ 


1 -6 (2.5) 
c= 3 - 4 


egression and correlation 


55 


i.e. V: 


-1 
1.6X 


Here the leact o 

estimated value of TitT haS 3 gradie nt (m) of +i 6 
<0 on the vertical 1 ^ b> 16 d nits for ev e T ° ,h6r WOrds the 

is -1- This least son, * T ° ther wor(J e when X T* * The intercept 
t Sdua tes line is plotted in Figure s 2( b) eStimated V3 ' de ° f * 

I?) a S T„ e F r g r 3 f 3 u " y n r efin h e ,he study ’ 

coding formufae. We C,hen cTTaf*" 8 ““ « « S* ZZ 

since the least squares line goes through thT th ‘ S ° ti8in (X ’ y > more easily 
longer have any problem in cTcZtT?° im ° f means <*•» » »ha we ™ 
course we can then go back to the 'trT zernT •*' “‘““P* at this ori 8 ln ' Of 
found m at the new origin (X, 7) ,gln and find c once we have 

!f we use the redefined data for xtX ~ 7\ a 
formulae will help us calculate m and i 

Coding formulae 

Here we change the origin from zero to the nnint 

new axes and data, x, (X, - X) and y (y, ^ meanS ^ and express 


y,‘ 

8 

y,(v 

\-Y) 

^ Y= mX+ c 


2 


) 

Y{ 6) 








-* x,(X,-X) 

c 










0 

X(5) < 

3 K 



New axes at this origin | 




— 

. . . .. 
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WORKED 
EXAMPLE 3.2 


X x ' y - 


/=! 



/=! 


c = Y — niX 


u . ra i ru iated m (the slope of the least squares line) at the new origin (*,?, 

V vprt to the zero origin to calculate c (the intercept). The slope, m, lso[ 
we revert to ^ origins for the linear least squares line. 

COU Let'u e s n follow through our previous worked example, but this time using 
reeled data. The two approaches should give us the same result. Le, us checi 

that this is the case! 


Calculate the least squares line for the same four observations shown on the 
scatter diagram in Figure 3.2a (p. 54). 


Solution 

Here we use the new origin (X,Y), namely (2.5,3) and redefine the data using the 
axes x, (X, - 2.5) and y, (V, - 3). 


figure 3.4 

Using redefined data 
(x,y,) to calculate 

% 
6 - 

the least squares 

line 

5 - 


4 - 


7-3 -- 


^ X 

(a) Cha "ging the origin from 0 
»-e. to (2.5,3) 



A 


to (X, Y), (b) Least squares line 


coorng 
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X, 


y, 


1 

2 

3 

4 

IX = 10 


0.5 

2 

4.5 

5 

IY= 12 


X 

y, 



(Xi - X) 

(Y, - Y) 

xy, 

x? 

-1.5 

-2.5 

3.75 

2.25 

-0.5 

-1 

0.50 

0.25 

0.5 

1.5 

0.75 

0.25 

1.5 

2 

3.00 

2.25 



Ixy = 8.00 

lx 2 = 5.00 


*= 2 * 

4 



12 

4 


= 3 


_X_ 

’ 1 * 


8.00 

m = - 

5.00 


1.6 


m = 1.6 


We have now calculated m for the least squares line at the new origin, which is of 
course the same as m at the ‘true’ zero origin. We now revert to the zero origin to 
find c, the vertical intercept. 


c = Y - mX 


i.e. c = 3 - 1.6(2.5) 
c = 3 - 4 

c = -1 

i p 9 = 1.6X - 1 

We have the same result for the least squares line as in Approach 1, using the 
same data in each case. Clearly these two approaches are alternatives, yielding 

the same results. 


DID YOU KNOW? 

The founder of the ‘least squares' regression technique 
was a French mathematician, Adrien-Marie Legendre. 
Born in 1752, he was awarded degrees in mathematics 
a nd physics. Having lost his wealth in the French 
Revolution in 1789 he was forced to ‘work' for a living, 
lecturing in maths in Paris, working at the Greenwich 
Observatory in London and writing books. He devtloped 
the least squares technique in 1805, which Wits furth 
developed by UK statistician Rober Gauss. 


Prediction 

Of course, once we have obtained our 
estimated least squares line we can use it 
for prediction (forecasting). We can 
insert values of the independent variable 
(A',) not yet experienced and predict values 
of the dependent variable () ,). Of course 
the confidence we can have in any such 
prediction will depend upon: 
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► the past relationship between Y and X continuing into the fu tlIre 

► the estimated least squares line (F) fitting the past data rather well 

We return to the 'goodness of fit' of the least squares line in the next seer 
our previous example we could substitute the value X, = 10 into 0Ur eqi ^ ln 

Y = 1.6X - 1 
Y= 1.6(10) - 1 

Y = 16 - 1 = 15 

Hence our predicted value for Y is 15, should X be given the value of io ^ 
consider forecasting in more detail in Chapter 4. 


PAUSE FOR THOUGHT 3.1 


_ Under what circumstances would you be reasonably confident in using you 

least squares line for prediction? Can you think of any problems that might arise 
to make these predictions unreliable? 


SELF-CHECK 3.1 (a) Use the data below to create a scatter diagram. 

QUESTIONS X 1 2 3 4 5 6 7 8 

Y 2 5 6 7 9 12 15 16 

(b) Find the regression line relating Y (as dependent variable) to X(as 
independent variable). 

(c) Comment on your results. 

3.2 (a) Use the data below to create a scatter diagram. 

X12 34 5678 9 10 

Y 19 18 16 16 20 13 6 6 11 9 

(b) Find the regression line relating Y (as dependent variable) to X (a s 
independent variable). 

(c) Comment on your results. 

3.3 Find the ‘line of best fit’ (least squares line) for the following data. 

A Q £9 

x 4 5 6 12 13 13 16 16 17 * 7 334 

Y 47 111 124 240 211 205 276 305 309 302 

X 20 22 24 

Y 302 371 241 

Note: Answers can be found on pp. 328-331. 


3.2 Correlation 

the actu^ 

The idea here is to measure how well the regression line fits 
Two key measures are frequently used in this respect. 

► the coefficient of determination (R 2 ) 

► Pearson's coefficient of correlation ( R ) 
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figure 3.5 

Deviation for each 
observation (</,) is 
the actual value (F,) 
minus the arithmetic 
mean (F). This can 
be split into two 
parts: explained 
deviation ( d e ) and 
unexplained 
deviation (d u ) 



Before considering foimulae for calculating these measures, it will be helpful to 
define the concepts of deviation and variation which underpin them. Figure 3.5 
will be used for illustration. 


Deviation (d,) is the difference between the actual value of an observation 
(F,) and its arithmetic mean (F). This deviation can be split into two separate 
parts. An explained part ( d e ) which is predicted or accounted for by the 
regression line. An unexplained part ( d u ) which is not predicted or accounted 
for by the regression line. 

Deviation = d t = F f - Y 

Deviation = explained deviation + unexplained deviation 
Deviation = d t = d e + d u 

Summing such deviations across all n observations gives total deviation. 


d. = £d, + £d„ 

7=1 1=1 


7 = 1 

total 


_ explained + unexplained 
deviation " deviation deviation 

• (J 2 } is the square of the difference between the actual value of 

lT1 u e nn Y and its arithmetic mean (F). As before, squaring the 

thTproWen, of firs, identifying, then dropping ,he sign ± C 

:h deviation. 

^ing the squared deviations across all * observations gives total vanat,ot, 
it ran be shown that: 


a " Ji, l7 

Id? = S''? + S d ? 

fit 

i.e. total = explained 
variation variation 


unexplained 

variation 
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figure 3.6 

Coefficient of 
determination (R 2 ) 


methods for business and 


economics 



PAUSE FOR THOUGHT 3.2 


How does total variation differ from the variance discussed in Chapter 2? 


Coefficient of determination (R 2 ) 

r2 _ explained variation 
total variation 

The coefficient of determination (R 2 ) is the ratio of 'explained variatior 
andtliprpf tl0n Clearly when = 1 as in Figure 3.6(a), then all the devial 
line. We have Iperfect fit" 56 CXplained or accounted for by the regress 

and therefore^riation^an bT* ^ ^ FigUre 3<6(b) ' none ° f the ^ 

line. We effectively havp a ex P la ined or accounted for by the regress 
line fits as well as any other^ 11 ° m SCatter Points to which any regress 
The closer R 2 is +q i ji 1 

data. etter the fit of the least squares line to the ad 

Pearson’s coefficient nf 

f of delation (R) 

R = , ( ^Pl^l!lyariatio n 

tota *variafion 

Pearsoncocffi t j . 

ol ^termination u*?* ° f Cwrre lution (R) ic t , i -npffiek 

Just as i/-‘ , J K) ls the square root of the toer 

is and" 1 ! 11 ^'' Valuo f "r l /i‘ofVi' ? R Varies between ±1 and 0. ' Pclt | 

When «« Mali; „ > r hen **>« relationship between t »«' 
1!, Utp between Kami x is indirect (in vcisl1 
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figure 3. 7 

Coefficient of 
correlation (/?) 



Figure 3.7 outlines the various possible values for R, the coefficient of cor¬ 
relation. This is often called Pearson's coefficient of correlation and can have 
values between +1 and -1. 

(a) R = +1 

This is perfect positive correlation. Here the relationship between Y and X 
is direct (positive), as both variables rise or fall together. The sign for R is 
therefore positive, and the value is +1 since the fit of the least squares 
line is perfect (all the variation is explained). 

(b) R > 0 but < 1 

Again a direct (positive) relationship between Y and X, so the sign is 
positive for R. Value for R is < 1, since not all of the variation is 
explained. 

(c) R = 0 (random scatter) 

Essentially no relationship between Y and X. All the variation is 
unexplained. 

(d) R < 0 but > -1 

Here the relationship between Y and X is indirect (negative or inverse) as 
each variable moves in the opposite direction; when one variable rises the 
other falls, and vice verse. The sign for R is therefore negative but the 
value is greater than -1 since not all of the variation is explained. 


(e) R — —1 

This is perfect negative correlation. The relationship between Y and A is 
indirect (negative or inverse) and the value is -1 since the tit of the least 
squares line is perfect (all the variation is explained). 
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Calculating R 2 and R 



DID YOU KNOW? articular 

The easiest way to interpret the meaning o 

value of R. the Pearson coefficient of correlation tsj 

square it. For example a value R = +0.8 means that 

If = (+0 8f = 0.64. i.e. 64 per cent of total variation 

can be • explained' or accounted for by your regression 

line. 


As when calculating the least 
two alternative approaches canh^'S 
derive formulae for calculating r? 6 ^ 
consistent, and follow the same 
here as you have done for calcuj 0 ^ 
regression line. dUn 8 th e 


Approach 1: Using original data (X,Y) 

Finding R 2 and R using original data 


ntx,Yi - 
1=1 1=1 1=1 


n ( n ^ 

2" 


n 

^ n ^ 

2 

, "X*?- x*, 


X 

M 

-c 

-• ro 

1 



«[ 1=1 V- 1 J 



/=1 

{1=1 j 



where n = number of observations 

R = Vr 2 


WORKED Here we apply our formula to the data and calculations already presented on 

EXAMPLE 3.3 p. 54. We reproduce this below, noting that we need an extra column for Y r 


X,Y, Xf Y 2 


0.5 l 0.25 

4 4 4 

13.5 9 20.25 

20 16 25 

1XY= 38 XX 2 = 30 XV 2 = 49.50 


1 0.5 

2 2 

3 4.5 

4 5 

£*=10 IY=12 


R 2 = 


R 2 = 


R 2 = 


4(38) - 2) 

I 4,30 ' '-0: P l-MD.r,; 


152-120 

J|l20 - 1001 x [19 8 - 144 ] 

_32__ 

I.V 20 x 54 




l 

32 

~7 r 

2 

32 ' 


VI080 


[32.86 
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WORKED 
EXAMPLE 3.4 


In other words 0.948 or, as a percentage, 94.8 per cent of the total variation is 
explained’ or accounted for by our regression line. This indicates a very good fit 
of our least squares line to the data, giving us greater confidence should we use 
our regression line for prediction (forecasting). 


Approach 2: Using redefined data 

Alternatively, we can transfer the origin from zero to the point of means (X,F) 
and use a coding formula for R z and R. In this case we make use of new axes 
x,- = X,. - X, y t = Y, - Y. 

Finding R 2 and R using the coding formula 

Here we can make use of calculations already undertaken in finding m and c 
for the regression line Y = mX + c when using the coding formula 

ixj 


2 >? 

i=i 



where x, = X, - X 
Y, = Y,-Y 

r= -Jr 2 


Here we apply our coding formula to the data and calculations already presented 
on p. 57. We reproduce this below, noting that this time we need an extra column 
for yf. 


Solution 




X 

y, 




X 

Y, 

(X, - X) 

(Y, - Y) 

xy, 

x? 

y? 

1 

0.5 

-1.5 

-2.5 

3.75 

2.25 

6.25 

2 

2 

-0.5 

-1 

0.50 

0.25 

1 

3 

4.5 

0.5 

1.5 

0.75 

0.25 

2.25 

4 

5 

1.5 

2 

3.00 

2.25 

4 

IX = 10 

I Y= 12 



Ixy = 8.00 

lx 2 = 5.00 

I>- = 13.50 


x = 


l£.“-2.6 

4 4 


Y = 


-ST 


12 

4 


3 


Using our coding formula 
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WORKED 
EXAMPLE 3.5 


methods for 


business and economics 



tz, nrwi 3.50) 


64.00 

—-= 0.948 

<2-7 


P 2 _ q.948 

Again, note that either approach will give the same solution. 


The following worked example uses both approaches to calculate 
line and the coefficients of determination and correlation for a da 


the 
a data s 


re gressi 


> 10(1 


y . 


Approach 1 solution: Using original data 


X, 

Y; 

XX 

x? 

Y> 

0 

-4 

0 

0 

16 

1 

-3 

-3 

1 

9 

2 

-1 

-2 

4 

1 

3 

2 

6 

9 

4 

4 

3 

12 

16 

9 

5 

6 

30 

25 

36 

IX = 15 

M 

-< 

II 

GO 

IXY = 43 

IX 2 = 55 

1Y 2 = 75 


m _ - Jjxjy 

m _ 6(43) - (15)(3) _ 258 - 45 213 

6(55) -(15) 2 ~ 330 - 225 = 105 

m = +2.03 


£= 2.03X- 4K 


c = Y- mX 

7=0.5, X = 2.5 
c = 0.5 - 2.03(2.5) 

c = -4.6 


R 2 = 


n X* 2 - 


(I*) 2 ] X jn£y* - 


R' = 



R y ,, 


_258 -^45 

n/I 105 | x [44lj 
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R 2 = 


213 


-|2 


V r 46^305 


R 2 = 


213 

215.2 


n2 


R 2 = 0.98 


R = a/o.98 
R = 0.99 


PAUSE FOR THOUGHT 3.3 


P/ot tAie scatter diagram from the data 
impose the least squares line on your scatter 
to find a high R 2 ? 


for this Worked example. Now 
diagram. Would you have expected 


Approach 2 solution: Using redefined data 




X, 

y, 




X, 

Y 

; fX - x; 

(Y, - Y) x,y, 

x 2 

yf 

0 

-4 

-2.5 

-4.5 

11.25 

6.25 

20.25 

1 

-3 

-1.5 

-3.5 

5.25 

2.25 

12.25 

2 

-1 

-0.5 

-1.5 

0.75 

0.25 

2.25 

3 

2 

0.5 

1.5 

0.75 

0.25 

2.25 

4 

3 

1.5 

2.5 

3.75 

2.25 

6.25 

5 

6 

2.5 

5.5 

13.75 

6.25 

30.25 

IX = 15 

XY= 3 



Xxy = 35.5 

Lx 2 = 17.50 

ly 2 = 73.50 

X= 2.5 

Y= 0.5 





2 > 

m = v 2 

2 > 

1 

l>- 

11 

CJ 

mX 

P 2 = 

(W 

(miyf) 



35.5 
m = - 

17.5 

c= 0.5 

- (2.03)2.5 

ff 2 - 

1.260.25 

1.286.25 




R 2 = 0.98 


m = 2.03 c = -4.6 

R = Vo.98 

Y = 2.0 3X - 4J3 P_5l0-_99 


activity 3 


The intention here is to derive a regression (least squares) line relating the 
population of Greece (Y) to time (X). This is important to the long-term planning 
of our company which sees Greece as a major export market. 
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lMC cc and ECONOMICS 

quantitative methods for BUS 

. Activity 2 (Cn. 


JDS FOR BU3M— 

. of Activity 2 (Chapter 2, pp. 45-46) and a sp,^ 
YOU can use the *« 0 * proa ch used involves the coding formula 
With which you are flrs , principles or use the in-built funa^ * 

p. 55). You can solve eroi 




Solving from first principles 


1 Set up your workshee ^ ^ Gree F population - from Activity 2 

Either load your origina ^ ^ ^ data from your original sheet to 

(Chapter 2, pp. 45 ) jnt0 a new spreadsheet. You should 

clean sheet or re-en e A3>1 but without the formulae. 

.ctim 9 f to Figur 


a new 
end up 



n 



n 


/=i 


(a) 

(b) 

(c) 

(d) 

(e) 


!n cell F 6 r i 9te *' ~ whe ^ 

i" os 

" 0611 H « square k‘. " 


Y values. 
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(f) Highlight cells E6 to H6 and copy the formulae down to Row 19. 

(g) In cell 620 & H20 find the sums of x,y f and xf. 

(h) In cell F3 calculate m by dividing the sum of x,y ( by the sum of xf. 

3 Find the value of c (intercept) where: c = Y- mX. 

Your table already contains all the values needed to substitute in the above 
equation, therefore, enter the formula in Figure A3.1 cell G3 to calculate the 
intercept. 

4 Calculate the coefficient of determination R 2 3 4 5 6 . 

Use the equation: 


5/,-y, 

(=i 


f n 

\ 

n \ 

x*? 


Xy? 

tH 

II 

) 

v- 1 ) 


Here we can make use of the calculations already undertaken in finding m 
and c for the regression line. 

(a) Extend your table by adding the heading yi A 2 in cell 15. 

(b) In cell 16 calculate yf. 

(c) Copy the formula down to cell 119. 

(d) In cell 120 find the sum of yf. 

(e) In cell 13 enter the formula for R 2 . 

5 Forecast the values of X using the equation: Y = c + mX. 

(a) Add the heading yhat in cell B5 and the years to be predicted in cells 
C20 to C22. 

(b) In cell B 6 calculate Y for X = 1986. 

(c) Copy the formulae down to cell B22. 

Cells B 6 to B22 hold the predicted values from the equation: 

Y = -90037801.5 + 50277.5(X) 

B20 to B22 hold the predictions for the years 2000-2002. 

Figure A3.3 (on p. 465) shows the complete solution. 


Using in-built functions 

1 Select a new unused worksheet and enter the following labels: 


B CD 


2 Slope 

3 

4 

5 V 

6 Population 


Copy the dependent data (Y) - the Greek population data into cells B7 to B20. 
Enter the independent data (X) 1986-2002 Into cells C7 to C23. 

You should end up with something similar to Figure A3.2 below. 


Intercept R A 2 

X 

Year Y hat 














uaicuia re m, c ana 

In cells B3:D3 use the in-built functions to calculate the slope or gradient, 
intercept and R 2 . In Microsoft's Excel, for example, the following functions and 
their arguments are used: 

= Slope (known Ys,known Xs) 

= Intercept (known Ys,known Xs) 

= RSQ (known Ys,known Xs) 

Calculate Y where Y = c + mX 

s 2oo1t 20021 int ° ce,is b21,o82! 

(c ) C °W the formulae down to cell mV" ° "*° ** eQUat '° n 

Cells D7 to D23 hold the predicted values from the equation; 

-90037801.5 + 50277.5(X) 

Figure A3.4 (p l h0ld the Predicted values for the years 2000-2°°- 

) Sh0ws ,he 'foplete solution. 


„ ,., k ^ 

sometimes data are DrP e . 

Ascending (last t 0 first).™' mav h'" k m,er ' whet her descending Oi' 5 ' ,c 

y be ,h » the ranking comes from two 
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WORKED 
EXAMPLE 3.6 


DsvrhnlnJ ^ en a PPl icants for a post are ranked both by interview and by 
rankings agree! 6 ^' ^ m ^^ lt t ^ len wis h t0 check to what extent the different 

. J" , SUCl \ sit ) Uations Spearman's coefficient of rank correlation (R s ) 

might be calculated. The formula for R s is as follows: 


6 


R* = 1 - 


i-1 


n 3 - n 


where R s = Spearman’s rank correlation coefficient 

d, = difference in ranking of a given observation 
n = number of observations 


If = 1, then clearly we have perfect agreement between the respective rankings. 
This follows from the fact that each d t would be zero when the respective 
rankings are identical, giving R s = 1 - 0 in the above expression. The closer to 1 
the value of R SI the better the agreement between the respective rankings. 


Ten candidates for a managerial post were ranked by interview and psychological 
test in the following manner: 



4 

B 

C 

Candidates rankings 
D E F G 

H 

/ 

J 

1 Interview 

4 

2 

7 

1 

5 

6 

9 

3 

10 

8 

II Psychological 

3 

2 

5 

1 

4 

9 

6 

7 

8 

10 

test 












Calculate Spearman’s rank correlation coefficient and discuss whether it 
represents a measure of agreement between the two types of test. 


Solution 

The first step is to take the difference in the ranks (of,); the second step is to 
square each of these differences; the third step is to sum the squares of the 
differences. We then substitute the result into the formula 


<j. 10201334 22 

d 2 10 40l99 16 44 


S<tf=4 8 

/ = 1 

6 Y dj „ 6 X 48 , 288 

Ft s = 1 - 10 3 - 10 990 


0.709 


The Spearman’s rank correlation coefficient shows some agreement in ranking 
between the two types of procedure. 
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We can use Spearman’s coefficient of rank correlation ( R s ) whenevp 
for two variables are given in rank order (i.e. the data are ordinal). Wh ® r d 
are given in terms of absolute values (i.e. the data are cardinal) We w 
use the earlier coefficient of correlation (/?). However we can convert twn % 
of data which are given in absolute values into rank order should We wi ° Sets 
calculate R s . Further, should one set of data be given in terms of absoj t0 
(cardinal) and the other in terms of rank order (ordinal) then only by Con 6 Vallle $ 
the cardinal data to ordinal and calculating R $ might we be able to estabi^*^ 
measure of ‘goodness of fit' between the two types of data. ^ 3 


SELF-CHECK 

QUESTIONS 


3.4 Using the data for Self-check question 3.1 (p. 58) 

(a) Find the coefficient of determination. 

(b) Find Pearson’s coefficient of correlation. 

(c) Comment on your results. 

3.5 Using the data for Self-check question 3.2 (p. 58) 

(a) Find the coefficient of determination. 

(b) Find Pearson’s coefficient of correlation. 

(c) Comment on your results. 

3.6 The data below give the actual sales of a company (£000s) in each of eight 

regions of a country together with the forecast of sales by two different 
methods. 


Region Actual sales (£000) Forecast: Method 1 


Forecast: Method 2 


A 

B 

C 

D 

E 

F 

G 

H 


15 

19 

30 

12 

58 

10 

23 

17 


13 

25 
23 

26 
48 
15 
28 
10 


(,) r e,a,ion . 

, 7 1 ales and Precast 1 

(2) Actual sales and forecast 2 


3. 4 Mu | t|p | e re g ress | 0 n 

Our analysis has, until th- *** . 

two vanables: a dependent variahi C °T lcentrated on relationships in 

e and an independent variable 


16 

19 

26 

14 

65 

19 

27 

22 


— muu ivrecast 2 

ecasting method would you recommend next year? 
*<>*: Answers can he ,ou„d on pp. 333-334. 
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regression AND CORRELA 
variables, and attempts to fit lines 


is termed multiple re S 

involved, with more 


many relationships involve more than two 
(or surfaces) of 'best fit' to such variables 
analysis. Here we merely touch upon a few of the issues 

detailed sources of information indicated on p. 76. nlication 

Win Hornby et al. (2001) provide a useful illustration of t e app 
of multiple regression analysis in their review of a study seeking to re 
demand for beer in the UK to as many as seven separate independent varia 
namely: 


1 P b the real price of beer (i.e. the price of beer in relation to the rate of 
inflation). 

2 A b the real level of advertising on beer (i.e. the level of advertising in 
relation to an advertising index of TV and press). 

3 A a the real level of advertising on other alcoholic drinks. 

4 W the daily mean temperature. 

5 F r the real level of personal disposable income. 

6 P c the real price of other alcoholic drink. 

7 S a seasonal factor to reflect the higher consumption of beer in the 
summer months and at Christmas. 


Clearly we no longer have a two-dimensional scatter diagram. Instead of a 
line of 'best fit' we now have a plane or surface which 'best fits' a multi¬ 
dimensional scatter diagram. Figure 11.9 (p. 272) gives an idea of a 'surface' in 
a three-dimensional situation. Statistical packages are frequently used to calcu¬ 
late such relationships. 

Using quarterly data stretching back over a 15 year period the authors of 
this study, W. D. Reekie and C. Blight, established the following results from 
60 data observations: 


Q = 3.18 - 1.355P b + 0.0044 b + 0.0194 a + 0.057 W+ 0.085V, + 0.371P C + 0.0195S 
(0.287) (0.64) (0.32) (2.66)* (5.84)* (0.10) (0.173) 

where Q = demand for beer in the UK (millions of barrels per year) 

R 2 = 0.57 

t- ratios are in brackets. * indicates the statistically significant variables. 


► Notice that the signs in front of six of the independent variables are 
positive, suggesting that a rise in those variables will increase the demand 
for beer in the UK (shift the demand curve to the right). 

► However the sign in front of P b , the real price of beer, is negative, 
suggesting that a rise in the price of beer will result in a fall in the 
demand for beer (i.e. a contraction in demand, moving up the demand 
curve from right to left). 

► The numbers (coefficients) in front of each independent variable indicate 
the magnitude of the respective relationships. For example, any given 
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percentage change in the price of beer will have the larges, ft** 
re“topact g on the demand for beer as compared to changes h 

► CeCgfCrc e e n pt™e'rm le of 3.18 indicates that should these seven 
independent variables have zero values, there would shll be a si 2eabif 
demand for beer in the UK (3.18 million barrels per year). 

► The R 2 figures of 0.57 indicates that this specified relationship only 
explains (accounts for) some 57 per cent of the total variation ( see 
p. 60 above). This means that some 43 per cent of the total variation 
is unexplained. For example we may have omitted one or more 
independent variables which really do have an impact on the demand 
for beer in the UK. 

► The figures in brackets refer to the t statistic scores (or t ratios) discussed 
in Chapters 6 and 7. The null hypothesis (see p. 159) for the t test 
conducted on each independent variable is that the coefficient in front 
of that independent variable is zero. The larger the t score, the more 
confident we can be in rejecting the null hypothesis, i.e. accepting the 
alternative hypothesis, namely that the independent variable in question 
really does have a non-zero coefficient. Put another way, the larger the t 
score, the more confident we can be in concluding that the particular 
independent variable is 'significant' in influencing the dependent 
variable. In the results presented above, we can be most confident about 
the influence on the demand for beer of expenditure on advertising on 
other alcoholic drinks and the daily mean temperature. 

Note: Perhaps surprisingly the sign of the coefficient in front of is positive; 

suggesting that as more is spent on advertising other alcoholic drinks, the demand 

for beer increases. This suggests positive 'spillover’ effects, namely that alcoholic 

alcoholic re drinks n also benefit' an<1 “ ° ne * ype ‘ S advertised 50 other ,ypeS * 



139 152 128 183 

Comment on the reliability of v„ . 

y your prediction. 

’ The weather was 

certain product. CS *" im P°rtant element in the sales pat* 

and ,lle average h"? 1 period *he average daily air teffll 
emperature. The resufts ‘" y Sales of the product were calc" 1 ' 
esu lh were as follows: 
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Temp fC) 

10 

11 

12 

13 14 

IS 16 17 

18 

19 

20 

21 

22 

23 

24 

Average 

103 

104 

95 

83 81 

75 68 47 

53 

40 

43 

40 

38 

39 

35 

daily sales 














(a) Plot a scatter diagram of the data. What does this suggest? 

(b) Calculate the linear regression equation that best fits the data. 

(c) Plot the least squares regression line on your scatter diagram. 

(d) Use the regression equation to estimate the number of products that 
will be sold if the temperature reaches 25°C. 


3.3 The data below show the lateness of trains (in minutes) arriving at a ter¬ 
minus station and the number of passengers alighting from those trains. 
The train operator believes that the number of passengers (T) will be greater 
if the train arrives a little later (X) than scheduled. 


Minutes late 

No. of passengers 

1 

1,459 

3 

395 

0 

534 

2 

641 

7 

927 

3 

650 

4 

447 

5 

392 

0 

569 

11 

713 

1 

401 

12 

2,691 

1 

443 

7 

883 

12 

1,577 

11 

147 

7 

568 

2 

455 

3 

632 

4 

531 


(a) 

(b) 

(c) 

(d) 


Plot a scatter diagram of the data. 

Calculate the linear regression equation that best fits the data. 

Plot the least squares regression line on your scatter diagram, 
is their any evidence to suggest that there is a relationship between 
the number of passengers and the punctuality of the tram. 


UA manager needs to determine the relationship between the company s 
advertising expenditure and its sales revenue. If the relationship s a strong 
d thenlhe company will be able to forecast the sales revenue tor a g.ven 
advertising expenditure. The following data have been collected over the 

past eight years. 
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Advertising 

(£OOOs) 


Sales 

(£OOOs) 


Q 74 4 U.^oo 

8.78 39.423 

9.25 47.389 


40.285 


45.707 


10.29 47.711 

11.22 52.500 

12.92 53.727 

13.99 64.389 


(a) Plot a scatter diagram of the data. 

(b) Calculate the linear regression equation that best fits the data. 

(c) Estimate the sales revenue when advertising is: 

£11,000 

£13,500 

£15,000 

(d) Is the relationship sufficiently strong to use as a predictive method? 

3.5 The data below show a hypothetical index of economic activity and the 
sales revenue for a firm over a 14 year period. 


Index of Sales 

economic activity revenue (£000) 


103 

104 

104 

105 
105 

107 

108 
108 
109 

109 

110 
111 
112 
112 


1,188 

1,224 

1,243 

1,239 

1,242 

1,138 

1,481 

1,562 

1,554 

1,566 

1,598 

1,562 

1,609 

1,499 



(b) How well does 


gression fine relating the two sets of data. Ltf s ‘* l 
s > endent var * a fil e - Comment on your result' 

Ur I0 8 ress 'on line fit the data. 


3.6 a - 
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Work period 
(minutes) 

Refects 

% 

60 

2.3 

65 

2.5 

70 

2.7 

75 

2.9 

80 

3.2 

85 

3.6 

90 

3.5 

95 

3.7 

100 

4.0 

105 

4.4 

110 

4.3 

115 

4.6 

120 

4.7 

125 

3.3 

130 

5.1 

135 

5.2 


(a) Calculate the regression line and use it to consider if there is any 
evidence to support the quality control manager's theory. 

(b) Given the above data, is there a case to suggest that work periods not 
exceeding two hours would reduce the number of rejects below 5 per cent? 

(c) How well does your regression line fit the data? 

3.7 The marketing department believes that consumer tastes for beer are related 
to geographical area. They have therefore commissioned research to inves¬ 
tigate the popularity of six leading brands of beer. Two different panels, one 
from the north the other from the south, tasted the products and ranked 
them in their order of preference. To avoid bias, the name of the products 
was withheld and substituted with the labels A-F. The results of the blind 
tasting were as follows: 


Northern Southern 
Product panel panel 


A 

B 

C 

D 

E 

F 


3 

4 

5 
2 

6 
1 


2 

4 
3 
6 

5 
1 


Is there any statistical evidence to suggest that the tastes of consumers 
are related to geographical region? 


,o A motoring organisation commissioned two independent testing laborator- 
Lto assess the safety of eight of the best-selling models ot car in the mid- 
rlilv saloon market. The laboratories used different testing procedures 
rd 8e me«hods "an k ,ng the models In order o, safety (where 1 - safest, 

8 = least safe) as follows. 










? 


76 QUANTITATIVE 


methods for 


business and economics 


Product 


Lab 


l Lab 2 


Chimera 205 
Apollo T7 
Pastiche 
Ganymede 
Callisto LX 
QM4 

Alfredo GTi 
Arctura 



1 


Is there any correlation between the two sets of results? 
Note: Answers can be found on pp. 392-402. 


Further study and data 

Texts 

Bancroft, G. and O'Sullivan, G. (1993), Quantitative methods for accounting mi 
business studies, 3rd edn, McGraw Hill, chapter 9. 

Curwin, J. and Slater, R. (1996), Quantitative methods for business decisions, 4th 
edn, International Thompson Business Press, chapters 17, 18 and 19. 

Hornby, W., Gammie, B. and Wall, S. (2001), Business Economics, 2nd edn, 
Financial Times Prentice Hall, chapter 6. 

Lawson, M., Hubbard, S. and Pugh, P. (1995), Maths and statistics for business, 
Addison Wesley Longman. 

Morris, C. (1999), Quantitative approaches in business studies, 5th edn, Pitman, 
chapters 13, 14 and 15. 

SwiT^r^onn 9 ^?' Essential elements of business statistics, BPP, chapter 8. 
Palgrave Publhhe”rs! pm^ 3 ^ ^ bUSimSS ' mam gement and ^ nam ' 
chapter 3 ^ Quantitative methods for business shidies, Prentice Hall 

Wat f rS ' D ‘ < 19 97), QuantitatU 


Longman, chapter^^^^ metbods f or business, 2nd edn, Addison 
business, Pitman, chapter i^" 6 ^ Foundation quantitative methods M 

Jr 

aUnd “ f Chapter 1(p . 2s) 

° n ;! 1 ." 6 ..zeroises 

Check the 

'‘" d to find extra questions, 











CHAPTER 4 


Time series and forecasting 


Objectives 


Introduction 


By the end of this chapter you should be 

able to: 

► use time series data to find both trend 
and seasonal variation as a basis for 
forecasting; 

► apply the technique of moving averages 
to help in the calculation of trend lines 
and seasonal factors; 

► combine time series and regression 
analysis for purposes of forecasting; 

► apply exponential smoothing as a 
forecasting technique; 

► be aware of the use of other 
approaches to forecasting in 
contemporary business practice. 


Data can be collected for purposes of analysis in a number 
of ways. For example data for household expenditure 
on food can be collected in different locations (nation, 
region, city, etc.) at a single point of time. These are called 
cross-sectional data. Alternatively data for household 
expenditure on food can be collected for a given loca¬ 
tion (nation, region, city, etc.) at different points in time. 
This is called time series data. 

In this chapter we consider the use of such time 
series data as a basis for estimating both trend and 
seasonal variation. As we shall see, such estimates can 
then be used as a basis for forecasting. 

In fact we have already touched on forecasting in 
Chapter 3 when we used the least squares line to predict 
values of the dependent variable (T) for as yet unobtained 


values of the independent variable (X). We take this 
asDect of forecasting using regression analysis a little further in this chapter. 
Indeed we see how we can usefully combine regression analysis with the results 

of our time series analysis to forecast future outcomes. 

The chapter concludes with a more general review of forecasting techniques, 
including exponential smoothing and other widely used techniques in current 


business praett* questions', responses to each 'Pause for thought' 

= t°o the Retifw quL.ons' can be found a, fhe end of the book. 


4.1 Time series analysis 

f Hata have been recorded through time, such as monthly, quarterly 
T^nZTjffor ouvut, safes revenue, profit employment unernpfoynrenr, 

prices, etc. 
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components Of time se" ^ eJ[hlbit one or more of the lollop 
Typically such time 

components. 


nponents. 

m whereby the variable appears to broadly 

A trend component mj 1 ^ J gh time . 
or fall (or remarn un * 8 ' b the »»■' i" 


rise 


A trenu through time. 

or fall (or remain un£ 8 > whereby the variab i e moves in regular 

A seasonal comp ^ nend line . Fo r example within a broad,, 

cycles mttmn ye ame (trend) , an ice cream manufacturer 

Sl"r^e peaksV sales in summer months and troughs * 
«ales in winter months (seasonal). 


sales in winter w-- ' 

S.T. " >7, h*h «». on w -* 

.___t-u nr nuarters or seasons within < 


▼ 4T1 4*Vl 1 


PAUSE FOR THOUGHT 4.1 


_ Can you list six 

already mentioned, the 
variation within a vear? 


products (goods or services) other than the ice cream 
demand for which you might expect to experience seasonal 


► A cyclical component (C), whereby the variable moves in a rather 
less regular cycle over the medium to longer term around the trend line 
For example some have claimed to observe a business cycle of some 
eight to ten years between periods of 'boom' and 'bust' in modern 
industrialised economies. Others have claimed to observe still longer 
cycles over 50 years, with the peaks of such cycles related to new 
technological breakthroughs, such as water power, steam, electricity, 
microelectronics, etc. These have sometimes been referred to as 
Kondratief cycles, after a Russian economist of that name. 

Note that the period of a cycle is often referred to as the time between suc¬ 
cessive peaks or successive troughs. 

j. 

► An irregular component (I), which is entirely unpredictable. The s tCK " 
market crash of 1987 was, for example, a dramatic and unexpected 
departure from trend in terms of share prices. 


PAUSE FOR THOUGHT 4.2 


Can you name three other irregular components which have influenced e 


wide range of UK producers in the past ten years? 


uk " 1 


Figure 4.1 presents two possible time series profiles for the value of a dep el 
variable (T). 
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figure 4.1 

some time series 
profiles 



in Figure 4 1(a) a clear upward trend is evident, around which there is 
In Figure w j re p r esenting successive peaks and quarter 3 

seasonal variatio , In Fi g ure 4.1(b) a clear downward trend is 

SUC rf around whTch there is a cyclical component with a period of around ten 
evident, a . peaks or successive troughs. 

years between succe > y ^ i51e for such profiles. We may, for example, 

Of course many ia [- on (e . g . quarterly or monthly) superimposed on 

have a shorter-term seas tren d using longer-term (e.g. annual) data, 

c timp series showing botn a cy 


W KNOW? . , ac o 

nstruction industry is widely regar 

Her for economic prospects sectors 

ny. Traditionally it is one one 0 , me 

iffected by economic '^°r demand when 
t sectors to experience nig 

lie recovery occurs. 


Additive or multiplicative model 

We can represent or 'model' our time 
series using the four components already 
identified. We can do this in one of two 
ways; either adding or multiplying the re¬ 
spective components. 
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figure 4.2 

Choosing an 
appropriate time 
series model 
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► Y = T + S + C + I Additive model 

► Y = T x 5 x C x / Multiplicative model 

As we shall see, we can use either of these models to calculate any trend line or 
seasonal variation. However there may be circumstances where one approach is 
more appropriate than another! 

As we see in Figure 4.2(a), the additive model is the most appropriate 
where deviations from the trend line are of a similar absolute magnitude from 
one peak (or trough) to another. This is shown in Figure 4.2(a) as occurring for 
short-term cycles (e.g. seasonal variations, with Q2 providing peaks and Q 1 
troughs). The same principle would equally apply if we were using annual data 
and comparing successive peaks or troughs in a business cycle over, say 1,1 
eight to ten year period. 

However as we see in Figure 4.2(b), the multiplicative model is the mo* 
appropriate where deviations from the trend line are of a similar percental 
one peak (or trough) to another. We can see that the absolute magnitude o ^ 
deviations from the trend line grows over time as the dependent varia 
rises, but that the percentage deviations remain roughly constant. 


moving average 

As we shall see in the next section, we can use the idea of a moving 1 ' 

. ^ * e trcnc * the data. To find the moving average we initia 

then IcTT ^ ut ithmetic mean) for a specified number of items n , 

a subset U, t ak tHat average havin 8 dropped the initial item of da ‘ 1 
a subsequent item of data. 

initiany h LTthe d sim a ? Ple bel ° W involvlng c l uarterly data f T t hen 

average along. P C average for four items of data, am 
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WORKED Find a four quarter centred moving average for the following data on sales value 

EXAMPLE 4.1 (£000): 


Solution 


(1) 


(2) 

(3) 

(4) 

(5) 

Year and 

Quarter 

Sales value 

4 quarter 

4 quarter 

Centred 4 quarter 



(£000) 

moving total 

moving average 

moving average 

1998 

1 

87.5 





2 

73.2 

314.0 

78.5 



3 

64.8 

316.8 

79.2 

78.9 


4 

88.5 

319.6 

79.9 

79.6 

1999 

1 

90.3 

324.0 

81.0 

80.5 


2 

76.0 

330.2 

82.6 

81.8 


3 

69.2 

333.8 

83.5 

83.1 


4 

94.7 

336.2 

84.1 

83.8 

2000 

1 

93.9 

339.0 

84.8 

84.5 


2 

78.4 

344.6 

86.2 

85.5 


3 

72.0 





4 

100.3 





It is often helpful to find the four quarter centred moving average in stages. 


► 


► 

► 


>ur quarter moving total (column 3). Here we simply sum the data for the 
itial four quarters in column (2) to find the four quarter moving total. Notice 
at the moving totals fall in between the actual quarterly data in column (2). 
r example the first moving total falls in between quarters 2 and 3 for 1998. 

_ Quarter moving average (column 4). We then divide the respective four 
■arter moving totals in column (3) by 4 to find the four quarter moving average. 

* Quarter centred moving average (column 5). It will help to align the 
! averages with the specific quarterly data in column (2). For this reason 
° centre the data in column (4) by summing respective pairs of data and 

t/idin£ bV 2. 


YOU KNOW? , mnuin a 

sy evidence suggests that the technique o 
B Ses is used by around 25 per cent o 
messes in an attempt to forecast future 


4.3 Finding the trend 


In practice we are often presented with 
monthly or quarterly data over a number 
of years for which no obvious business 







ns f0 r business and economics 

82 quantitative method resent We can therefore regard the m 

cycle (0 Of eight to ten -7* ^ four compo nents previously 

lata, V, as having only 
ie y = T■*- S + I (Additive) 

V TxSxI (Multiplicative) 

°' anoropriate moving average for the monthly or quart 

If we calculate an appi V the seasona i variation component (S) and th 
data, then we can damn b , data This will leave us wj(h h 

irregular component (/) from We 

required trend line. 

„,rina average in column (4) of the worked 

► Using a four hat * e can eliminate any seasonal variation (S) as 

example abov ters p or example in column (3) after our initial 

calculation of 314 we drop the first quarter of 1998 but add the first 
Quarter of 1999 to get 316.8. Our four quarter moving total which is the 
basis for the subsequent centred four quarter moving average in column 
(5) therefore continually takes in all quarters, both high (as for quarter 1) 
and low (as for quarter 3). In this way column (5) continually eliminates 
S, the seasonal variation component. 

► In calculating columns (4) and (5) we are averaging , and any averaging 
helps eliminate any irregular component. In this way column (5) helps 
eliminate I, the irregular component. 


If Y=T + S + I 


or Y = T x S x I 


then column (5), having eliminated 5 and J, leaves us with T, the trend line. 

Figure 4.3 plots the original quarterly data (F) of column (2) and the trend line 
(T) of column (5) for the worked example. 

Notice how we use the four quarter centred moving average for the trend 

wonM 1 f Se Strai ^f ^ nCS t0 connect successive points. An alternative approach 

Trr iine (see p - 52) which r: 

approach below (p. 87). 6 tIend ' We consider this alternative 


PAUSE FOR THOUGHT 4.3 


Can you name three products (on n w 

sales trend is well established and th °° S ^ Services ) for which an upwan 
trend is well established? ' ree pr °ducts for which a downward 


The nature of the data will 

eliminate .S' and / and therefore le^ ^ ^ ot movi ng average requ 

u , e onl y tll <? trend line, 7. 

Using moving avfira^ « 


Using moving averages t 0 find trend (yy 

► Quarterly d ata: US0 

► ZLTJZZ i, uarter ° en,rea moving average t0 elimire 

1 and ,eaVe r ^ ° entred movin g average ,o eliminate 
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figure 4.3 

Plotting the trend 
line 


SELF-CHECK 

QUESTION 



4.1 UK passenger movements by air are shown in the table below (data in 
100,000 passengers): 


Year 

Q1 

Q2 

Q3 

Q4 

1998 

44 

80 

120 

60 

1999 

52 

88 

126 

62 

2000 

60 

98 

140 

68 


Use an appropriate moving average to find the trend of the data. Plot the 
original data and your trend line on a graph. 

Note: An answer can be found on p. 334. 


4.4 Finding and eliminating the seasonal variation 


When the original data (Y) has no obvious longer-term business cycle in evid¬ 
ence, we have already noted that it can be represented as either: 


Y = T + S + 1 (additive model) 
or y = T x S x 1 (multiplicative model) 
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Finding the seasonal variation (S) 


We can easily extend our work in using moving averages to find f th 
component, so that we can find S, the seasonal variation component p 
ally we need only add an extra column to the calculations already Un( l 

to find T. This extra column which gives us S (and I) will be (Y - t\ in ! rtaken 

(yA ln the case 

of the additive model or |^-J in the case of the multiplicative model, f 0r th 

following reasons: 


Y = T + S + I (additive model) 


MA = T 
Y-MA = S ± I 

or Y = T x S x I (multiplicative model) 
MA = T 


As we shall see, a further simple averaging process will remove I, leaving 5, the 
seasonal variation which we require. 


WORKED We can demonstrate this approach by using the earlier calculations (p. 81) for 

EXAMPLE 4.2 sales value in Worked Example 4.1. Here we shall use the additive model. 


Solution 


table 4.1 

Time series of 
quarterly sales value 
(£ 000 ) 


Y 

(original data) 


1998 

Ql 

87.5 


Q2 

73.2 


Q3 

64.8 


Q 4 

88.5 

1999 

Ql 

90.3 


Q2 

76.0 


Q3 

69.2 


Q4 

94.7 

2000 

Ql 

93.9 


Q2 

78.4 


Q3 

72.0 


Q4 

100.3 


T 

Four quarter centred S + / 

moving average (Y - V 


78.9 -l 4 - 1 

79.6 8.9 

80.5 98 

81.8 -5.8 

83.1 -13.9 

83.8 1°- 9 

84.5 9 - 4 

85.5 - 7 - 1 
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Subtracting our trend values (T) from the corresponding quarterly sales values (Y) 
gives us S + / over successive quarters. The averaging process will then tend to 
remove /, leaving us with S as required. 

A simple table can usefully illustrate this process. 


table 4.2 

Finding the seasonal 


Ql 

Q2 

Q3 

Q4 

variation factors 
(£000) 

1998 

1999 

9.8 

-5.8 

-14.1 

-13.9 

8.9 

10.9 

2000 

Total 

9.4 

19.2 

-7.1 

-12.9 

-28.0 

19.8 


average (S) 

9.6 

-6.45 

-14 

9.9 


The averaging process for each quarter leaves us with S, the seasonal 
variation for that quarter. Clearly quarters 1 and 4 are high sales quarters for this 
product, whereas quarter 2 and especially quarter 3 are low sales quarters. 

Adjusted S 

The data shown in the ‘average’ row for S are actually unadjusted. If we sum the 
plus values we have +19.5 (9.6 + 9.9) but if we sum the negative values we have 
-20.45 (-6.45 + -14.0). In other words there is a net value for S of -0.95. 

Strictly speaking the plus and minus values should cancel out. We can 
compensate for the -0.95 by adding +0.95/4 (i.e. +0.24) to each of the four 
quarterly values for S. These values would then be the adjusted values for S. 


table 4.3 


Ql 

Q2 

Q3 

Q4 

rinaing me aajusieu 

seasonal variation 

factors (£000) 

Unadjusted S 
(0.95 net) 
Adjusted *S 
(zero net) 

+9.6 

+9.84 

-6.45 

-6.21 

-14.0 

-13.76 

+9.9 

+10.14 


+ 


°- 95 ) j.e. +0.24 added to each quarter 


it iq these adjusted values for S that we usually refer to as the seasonal variation 
components Here we can see that sales are normally +9.84 above trend for Ql. 
bur-6.21 below trend for Q2, and so on (all values in £000). 


Eliminating the seasonal variations (S) 

• * « the seasonal variation component, our next step is to eliminate 

Having foun , then have an estimate of Y without any 

resulting trom Lena, influences on Y. Such data with the seasona, 

vadah on Removed fs sometimes calied — ..sed 
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ACTIVITY 4 


table 4.4 

Eliminating the 
seasonal variation 



Y 

(original data) 

S 

(adjusted seasonal 
variation) 

from the original 

1998 

Ql 

87.5 

+9.84 

data (i.e. 


Q2 

73.2 

- 6.21 

deseasonalising 


Q3 

64.8 

-13.76 

the data) 


Q4 

88.5 

+10.14 


1999 

Ql 

90.3 

+ 9.84 



Q2 

76.0 

- 6.21 



Q3 

69.2 

-13.76 



Q4 

94.7 

+10.14 


2000 

Ql 

93.9 

+9.84 



Q2 

78.4 

- 6.21 



Q3 

72.0 

-13.76 



Q4 

100.3 

+10.14 


Y~ S 
(desea sonaiis e0 


data) 


77.66 

79.41 

78.56 

78.36 


80.46 

82.21 

82.96 

84.56 


84.06 

84.61 

85.76 

90.16 


Notice that in eliminating S by subtracting S from Y, we sometimes have 

' o.z — b.21 = 79.41. 


-+, as in Q2ofi998wr 


can easily itaSTtaT' variat '°n component, S, calculated above 
for Illustration T"" ^ 7 ' Here «>. 

multiplicative mode, we would "iZTiZV^Zl 


11 can be an involved process cal 

The spreadsheet (p. 467 ) D . . ° U atlng tbe components of time series da 
values of the respective ouaZTf for such calculations with th< 

and results carried out automata n ^ 9 entered in column D and the calcula 
e concept can be adapted for a Although the ,a yout is for quarterly da 
""• * and J! determine , he 1,2 “ me Seri6S data ' values 2 and t»c 
/ . ® layout and formulae fnr th ^ ° f decimal Places in selected column 

(P. 467). for the spreadsheet are shown in Figure A4.2 

As y°u will see, you arp . 

es,ima * e the Vend of th^ te ° hnidae ° f a duaner centred r 
also f S6d 8 tWelve month cent ^ d8ta ^ had the data been monthly, you 
forecasu^ "* sea on .T™ 8 aVeraga <° estimate trend,. You 

trend estimat^ W '" neea t0 subtract ^ t ° rS ' ' f y ° U are using your ,rend 

e to get a more am e seasona l factors from each quarto 
Se, "“te accurate forecast for that puarter. 


Set U P your spre d cast for that quarter. 

Year Q _ enter the quarterly data values show 

Q4 


Year 1 
Year 2 
Year 3 
Year 4 


T96.9 

324.1 

415.2 
513.6 


295.5 
418.0 

528.6 
550.8 


349.4 

447.5 

550.6 

670.6 


389.3 

456.4 

615.3 

754.4 












time series and forecasting 


► cLT nt on your 

► RepeaUhe procedure usin^thTT 56 °" P ' 466 

and Review questions involving moving r avereir' UeS in the Var '° US Sel, - check 


^S Forecastmg: time series 


We have already looker! at „ a- . 

analysis of Chapter 3 ( p (forecastin §) usi ng the simple regression 

line (line of 'best fit' ^"^,"7?" ““ ° f fl " din * ■ >« TO 

extrapolate (take forward) the regresdn ■ \ Serles “mponents. We can then 
to derive a forecast of hr,‘re safes '** ^ ““ trend eStimates 


Forecasting: trend value only 

the earlier* scaTto magrems 5 ^ F^LTa‘p M) mT'd'T eS “ mateS ^ 

“clS ~ Z a Tr ^ S 

values. “ be USed fol predkti ° n (forecasting) of future trend 

We must initially calculate the regression line for the eight trend « Hm ,t 
shown in Figure 4.3 and Table 4.1. We can use either of tL , d estlmates 
considered in Chapter 3 for calculating a regression line Here° appr ° ach . es 
'original data' formula (p. 53), with the dependent variable y standTngfo'r th* 
trend value and the independent variable X standing for time (quartefs) With 
trend estimates available for eight successive quarters, we let X = 1 for the^S 
available quarter for a trend estimate (Q2, 1998), rising to X = 8 for the eighth 
available quarter for a trend estimate (Q3, 2000). S 

From Table 4.1 above (p. 84) we can then construct a new table allowing 
the application of our regression formula: 6 


X Y XY x 2 

(quarter of data) (trend value) 


1 

2 

3 

4 

5 

6 

7 

8 

IX = 36 1Y = 


78.9 

78.9 

79.6 

159.2 

80.5 

241.5 

81.8 

327.2 

83.1 

415.5 

83.8 

502.8 

84.5 

591.5 

85.5 

684.0 

657.7 

I XY = 3,000.6 


1 

6,225.2 

4 

6,336.2 

i) 

6,480.3 

16 

6,691.2 

25 

6,905.6 

36 

7,022.4 

49 

7,140.3 

64 

7,310.3 

= 204 

IV 2 = 54,111.5 













88 


QUANTITATIVE METHODS FOR BUSINESS AND ECONOMICS 


X 


36 =4.5 Y = = 82.2 


8 


8 


where m = 


i.e. 


m = 


m = 


m = 


We noted (p. 53) that, using original data, the formula for the re 
squares) line is: §ressi °n 

Y = mX + c 

n^XY - 

" X * 2 - ( X *) 2 

and c=Y - mX 

where n = the number of observations 

8(3,000.6) - (36)(657.7) 

8(204) - (36) 2 

24,004.8 - 23,677.2 

~1,632 - 1,296 

327.6 
336 

m = +0.975 

c = Y-mX 
c = 82.2 - 0.975(4.5) 
c = 77.S 

■£z ^ 0.975X + 77 a 
where Y T = trend sales value (£000) 

Wecan^rir^^ 1998 ^ 

forecast toure trend ' 

line into the fm. re gression line w 

The dashed part of ® d and extrapolates the 

We caifuseth' 5 eXtended beyond that the extra P ola,ion 0 

Figure 4.4 that- 6 regressi °n line for for*. °* Whlch data are currently avai 
when X ° re casting. For example we can see 

„ ~ 12 <l e ' (J2 ' 2001) 

0.975(12) + 77 8 

. r = 1 T7 + 77.8 

n *jasijpi d 


Oe* 


St 


i.e 




o * - & 

0 . 975 ( 20 , + 778 
p . = 19.5 + 77 8 


(U - % 2003 ) 
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figure 4.4 

Using the regression 
analysis to find the 
trend line and using 
that line for 
forecasting 


Of course, the better the fit of our regression line to the existing trend data, the 
more confident we can be in extrapolating the line to make forecasts in the future. 

This brings us back to the coefficients of determination and correlation 
(Chapter 3, pp. 58-68). In this particular case there is an extremely good fit 
for our trend regression line. 

Using original data 


R 2 = 


nZXY - 


i.e. R z = 


[«X * 2 - (X x f] x - E 52 ) 2 

8(3,000.6) - (36)(657.7) 


l2 


^8(20^30^x^(54,111.5) - (657.7) 2 ] 


2 


24,004.8 - 23,677.2 _ 

■^^^296^32,892.() - 432,569.3] 



327.6 

2 

327.6 I 2 

R 2 = 

7336 x 322.7 J 


329.3 


^2^j^oenicUdlLoi-rniinatk .)Ojm 

LLOlL =- ( ^ 99 7 
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oimPQ c and economics 
quantitative methods for bus 

. TTOfi 5 


ETHUUO rw.x- 

«. n f the total variation explained or 'accounts 
With over 99 per ce can have considerable confidence J f ° r ' by 

our regression line from our regression line. * f < 

ing into the future y 

Forecasting: trend and seasonal factors 

Of course any forecast for the sales in a particular quarter will be more 

into account the seasonal vanaUon S, components already ^ 
the trend. The adjusted values for S were as follows (£000): ® 


if it takes 
as well as 


■ate 

ted 


m t±Q ra\ 02 (-6.21) Q3 (-13.76) Q4 (+10 




We must add or subtract these seasonal variation values to our trend forecast if 
we are to be more realistic about the likely future sales in a particular quarttI 
For example, we can forecast the trend using the equation 


Y T = 0.975X + 77.8 
where X = 0 for Q2, 1998. 

This gives the trend forecast for Q2, 2001 as follows: 
► when X = 12 (i.e. Q2, 2001) 

Y T = 0.975 (12) + 77.8 
Yj = 89.5 

However Q2 also has a seasonal component 
5 = -6.21 


So our forecast for ? T+S with both T and S included is: 
F T+ s = 89.5 - 6.21 
T t+s = 83.29 (£000) 


Table 4.5 shows the forecast (?) 
2003 inclusive. 


for sales value (£000) over the period 2001- 


table 4.5 

Forecasts of Y r+S 
including trend ( T) 
and seasonal 
variation (S) 
components. 
Additive model with 
least squares line 
fr = 0.975X + 77.8 
for trend forecast 
(Q2, 1998 = 0). 


Year and 
quarter 


X* 


(Y t ) 

Trend forecast 


2001 


2002 


2003 


Ql 

11 

Q2 

12 

Q3 

13 

Q4 

14 

Ql 

15 

Q2 

16 

Q3 

17 

Q4 

18 

Ql 

19 

Q2 

20 

Q3 

21 

Q4 

etc. 

22 


88.5 

89.5 

90.5 

91.5 

92.4 

93.4 

94.4 

95.4 

96.4 

97.3 

98.3 

99.3 


(S) 

Seasonal variation 

+9.84 

- 6.21 

-13.76 

+10.14 

+9.84 

- 6.21 

-13.76 

+10.14 

+9.84 

- 6.21 

-13.76 

+10.14 


Total forec® 1 


98.34 

83.29 

76.74 

101-64 

102.24 

87.19 

80-64 


105.54 

106.24 

91.09 

84.54 

109.44 
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SELF-CHECK 

QUESTION 


4.2 The data below show UK sales of a particular model of car over three years 
(figures in 100,000 units). 


Quarter 

1 

2 

3 

4 

1998 

66 

106 

140 

82 

1999 

73 

119 

165 

91 

2000 

85 

130 

205 

100 


(a) Use the technique of moving averages to find values for the trend 
component (T). 

(b) Plot the original data and your trend values on a scatter diagram. 

(c) Find a ‘least squares line’ to best fit your trend values. 

(d) Estimate the season variation component (S). 

(e) Forecast future sales, taking both trend and seasonal variation into 
account in: 

(i) Quarter 1, year 2003 

(ii) Quarter 3, year 2003 

(f) How confident can you be in these forecasts? 

Note: An answer can be found on p. 335. 


4.6 Forecasting: exponential smoothing 


A widely used technique in business forecasting is that of exponential smooth¬ 
ing. The idea here is that the forecast at time t for the next time period (t + 1) 
should take into account the observed error in the forecast made for t in the 
previous time period (t - 1). 

Clearly such 'exponential smoothing' implies a learning process, whereby 
future forecasts are continually revised (smoothed) in the light of previous 
experience. Strictly speaking, this approach is most appropriate when there is 
little or no trend (T) in the data and little or no seasonal variation (S). It is best 
used for short-term forecasting, for example forecasting the outcome of the 
next time period. 

We can express this approach as follows: 


Next forecast = Previous forecast + Some proportion of the previous 

forecasting error 


= Y, + a(Y,- ?,) 

where Y, = actual data (observed outcome) in time period t. 

Y Y ' = forecast data for next time period (t or t + I) made in previous 
time period (t - 1 or t respectively) 
a = smoothing constant 
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Put another way 

? M =y, + «( £ <) 

where E t = error term = Y t -f, 

The value assigned to a, the smoothing constant, can vary b etWepn 
. n 2et 0 

and one 

i.e. 0 < a < 1 

We can usefully consider the extreme values of a by way of illustration. 

► When a = 0, no adjustment is made for the previous forecasting error 
The next forecast is then assumed to be the same as the previous forecast 

► When o=l, full adjustment is made for the previous forecasting error. 
The next forecast is then the previous forecast ± the entire amount of any 
previous forecasting error. 

In practice a can be derived experimentally, based on the average size of the 
error term in previous forecasts. Values of between 0.1 and 0.3 are typically 
assigned to a. 

In the Table 4.6 below we continue this exponential forecast into other 
quarters, based on the actual values of Y (quarterly sales) shown (000 units) 


table 4.6 

Exponential forecast 

of future demand 

(000 units), a = 0.2 

Quarter 

Y t 

(actual sales) 

Y t 

(forecast sales) 

E t 

(Y, ~ ? t ) 

1 

96 




2 

102 

84 

18 


3 

104 

87.6 

16.4 


4 

100 

90.9 

9.1 


5 

88 

92.7 

-4.7 


6 

92 

91.8 

0.2 


7 

96 

91.8 

4.2 


8 

94 

92.6 

1.4 


9 

90 

92.9 

-2.9 


10 

96 

92.3 

3.7 


WORKED 
EXAMPLE 4.3 


A firm is at the end of quarter 2. At the end of quarter 1 it had forecast that 
demand for its product in quarter 2 would be 84,000 units, only to find that a 
emand turned out to be higher at 102,000 units. Provide an exponential for 
for demand in quarter 3, using a = 0.2. 


Solution 

Y 2 - 102,000 (actual demand Q2) 

Y 2 - 84,000 (forecast demand Q2) 

Here t, the current time period. Is quarter 2 and a is 0.2 
Y t+ i - Y, + a(Y t - y,) 
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i.e. Y 2+1 - Y 2 + 0.2 (Y 2 - Y 2 ) 

Y 3 = 84,000 + 0.2 (102,000 - 84,000) 

Y 3 = 84,000 + 0.2 (18,000) 

Y 3 = 84,000 + 3,600 
Y 3 = 87,600 

The impact of this exponential smoothing with a = 0.2 on forecast sales is shown 
in Figure 4.5. 


figure 4.5 

Forecasting future 
sales using 
exponential 
smoothing and 
different values for a 



Different values of a 

In Table 4.6 we assumed the 'smoothing constant', a, to be 0.2. We have already 
looked at the impact of extreme values (a = 0, a = 1) of a on exponential 
smoothing. In general we can say the following. 

► When a is close to 0, only a small proportion of the latest error (£,) will 
be included in the next forecast (? f+1 ). Clearly that next forecast will not 
differ greatly from the previous forecast. 

► When a is close to 1, a large proportion of the latest error (£,) will be 
included in the next forecast (f, +1 ). Clearly that next forecast will differ 
greatly from the previous forecast. 

As we can see from Figure 4.5, using a = 0.2 gives a smoother sequence of 
forecasts than would occur using a = 0.4. This is because the a = 0.2 sequence 
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THODS FOR 

£ V ic less influenced by any actual observation 

of forecasts for K (+ , is ies> 

^L««nrp from its forecast value {Y t ). 




Choosing a value for a 

It would seem reasonable to choose a value of a which, on previous expe rtence 
would have produced the 'best’ forecast using exponential smoothing. 'Best' in 
This context would arguably be the value of « which gave a smaller se quence £ 

errors (£ t ), where E t =Y t - Y t . 


ars (h t ), wnere c t - * t * r . 

Two nossible ways of evaluating previous forecasts 




Mean absolute error (MAE) 

Here we select that value of a which, on previous experience, minimises the 
average of the absolute errors (ignoring sign) of deviations between actual (K t ) 
and forecast (? t ) values: 


lie,I 

MAE = —— 
n 

where | | is modulus, i.e. ignore signs 

n is number of error terms (Y t -Y t ) 

Notice that n, the number of error terms, will be one less than the number of 
items of data since we do not have a forecast and therefore error term for the 
first item of data. 


Mean squared error (MSE) 

Here we select that value of a which, on previous experience, minimises the 
average of the squares of the errors. We have no sign problem here, as all 
squared values are positive: 


MSE = 



where n is number of 


error terms (Y t -Y t ) 


WORKED 
EXAMPLE 4.4 


Use the data of Table 4 a 

respectively. Which vai ! provide forecasts using a = 0.2, 0.3 an 

ue 01 would you select for future forecasts? 


Solution 


already plottpH 

formally work out MAE a h 8ta for 01 ~ 0,2 and °- 4 on Y\gure 4.5 
fen items of data. ™ MSE for the nine error terms (E t ) based 
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Review 

qu estions 


Quarter 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 



a = 

0.2 


a = 

0.3 


a = 

0.4 


y , 

E t 


Ef 

E t 


Ef 

E t 


El 

96 










102 

18.0 


324.0 

18.0 


324.0 

18.0 


324.0 

104 

16.4 


269.0 

14.6 


213.2 

12.8 


163.8 

100 

9.1 


82.8 

6.2 


38.4 

3.7 


13.7 

88 

- 4.7 


22.1 

- 7.6 


57.8 

- 9.8 


96.0 

92 

0.2 


0.0 

- 1.4 


2.0 

- 1.9 


3.6 

96 

4.2 


17.6 

3.1 


9.6 

2.9 


8.4 

94 

1.4 


2.0 

0.1 


0.0 

- 0.3 


0.1 

90 

- 2.9 


8.4 

- 3.9 


15.2 

- 4.2 


17.6 

96 

3.7 


13.7 

3.3 


10.9 

3.5 


12.3 


X |£ f | = 60.6 

I Ef = 

739.6 

Z|£i| = 58.2 

1E? = 

671.1 

1151 = 57.1 

ZE ? = 

639.5 


Note. Although there are ten items of data there are only nine error terms, hence n = 9. 

MAE = 


n 


(a) a = 0.2 = = 6.73 

9 

58 2 

(b) a = 0.3 = —- = 6.47 

9 

57.1 

(c) a = 0.4 =-= 6.34 


MSE = 


I£l 

n 


(a) a =0.2 = = 82.18 


_ 671.1 

(b) a =0.3= — = 74.57 

(c) a = 0.4 = ——— = 71.06 


In this particular example the value a = 0.4 will give the minimum value for 
both mean absolute error (MAE) and mean square error (MSE). It follows that 
our previous experiences of forecasting would arguably lead to the choice of 
a = 0.4 for future forecasts when choosing between the possible values of 
a = 0.2, 0.3 or 0.4. 


4.1 As a result of local government cutbacks some branch libraries are threat¬ 
ened, with closure. A newly appointed chief librarian claims that a certain 
branch library has a declining patronage, based on his observations of the 
last three months. The head librarian of the branch disagrees with the 
analysis and quotes quarterly data recorded over the past three years for 
book issues to support his case. That data are as follows: 
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Year 


business and 

economics 


Q1 

Q2 

Q3 

Q4 

2,542 

2,766 

2,944 

2,826 

2,905 

3,140 

2,991 

3,048 

3,333 

2,644 

3,137 

3,125 


'TTTT^Tline using the moving average method. 

4 ' 2 ^ is in “ f ^ ” n „ 

fte company and has been provided with the following quarterly sales 6gW( „ 

Unit salts 1997-2000 (OOOs) 


Year 

1 

2 

3 

4 


Q1 

Q2 

Q3 

Q4 

441.1 

397.7 

396.1 

472.8 

476.4 

454.4 

450.8 

553.5 

580.7 

573.2 

571.6 

703.6 

692.0 

676.5 

659.9 

752.8 


(a) Calculate the trend line using the moving average method. 

(b) Plot the original data and the trend values on a scatter diagram. 

(c) Estimate and then eliminate the season variation component. In 
other words find the deseasonalised data. 


4.3 A company producing torches is negotiating with a company for the supply 
of torch bulbs. The bulb company therefore needs to plan its production 
to meet the needs of the torch company and thus uses that company's 
quarterly sales figures over the past three years to forecast future demand. 
The sales figures are as follows: 


Quarterly sales figures 1998-2000 (OOOs) 


Year 

Q1 

Q2 

Q3 

Q4 

1 

349.4 

295.5 

196.9 

389.3 

2 

447.5 

418 

324.1 

456.4 

3 

550.6 

528.6 

415.2 

615.3 


season movin§ avera ge technique find the trend observations and th 
observational T° n factorS- Fit a least squares regression line to the h 
sonal variar f ^ trend re § ression line and your estimates of the j 
year 4 f or torc^^ 0 ^ t0 forecast future demand for the four quarte - 


for the value of ^ a ^ ei °f a company is responsible for providing a 

now to make However ' s he has been taken ill and the dep 

_ . KL llle forecast for tu« .... . .*sai snv 


now to make the*f ^» 11()wevei ' s * le has been taken ill and the < 
method, favoured hvTi^ f ° r the next m °nth. The exponential 
believes that an « G f o 3 C ° mpany ' llses an « of 0.2. However, the 
data provided fomra** * S 3 more realistic smoothing constant. 

' 0recast tot 'he next month using each smoothing a 
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t 

Y 

1 

3.85 

2 

3.22 

3 

3.18 

4 

3.61 

5 

3.62 

6 

3.48 

7 

3.53 

8 

3.62 

9 

3.36 

10 

3.58 

11 

3.62 

12 

3.24 


Calculate the mean absolute error and the mean square error for each 
smoo mg constant (assume an initial forecast for t=l of 3.49). Which 
smoothing constant should be recommended based on the criterion of 
minimising the mean square error? 

Note: Answers can be found on pp. 402-415. 


Further study and data 
Texts 

Bancroft, G. and O'Sullivan, G. (1993), Quantitative methods for accounting and 
business studies , 3rd edn, McGraw Hill, chapter 10. 

Curwin, J. and Slater, R. (1996), Quantitative methods for business decisions, 4th 
edn, International Thompson Business Press, chapter 20. 

Lawson, M., Hubbard, S. and Pugh, P. (1995), Maths and statistics for business, 
Addison Wesley Longman, chapter 11. 

Morris, C. (1999), Quantitative approaches in business studies, 5th edn, Pitman, 
chapter 16. 

Oakshott, L. (1996), Essential elements of quantitative methods, BPP, chapter 3. 

Swift, L. (2001), Quantitative methods for business, management and finance, 
Palgrave Publishers, part S. 

Thomas, R. (1997), Quantitative methods for business studies, Prentice Hall, 
chapter 6. 

Waters, D. (1997), Quantitative methods for business, 2nd edn, Addison Wesley 
Longman, chapter 9. 

Wisniewski, M. with Stead, R. (1996), Foundation quantitative methods for 
business, Pitman, chapter 13. 

Sources of information and data 

See the list at the end of Chapter 1 (p. 25). 


On-line exercises 


Check the web site www.booksites.net/burton to find extra questions, Spreadsheet 
and on-line exercises for this chapter. 







CHAPTER 


probability 


Objectives 


Introduction 


When you have read this chapter you 
should be able to: 

► distinguish between experimental an 
theoretical probability; 

► apply the ‘AND’ rule to independent 
events but the ‘OR’ rule to mutually 
exclusive events; 

► use Venn diagrams to solve problems 
where events are not mutually 
exclusive; 

► solve problems involving conditional 
probability, including those where Bayes 
theorem can be used; 

► use decision tree analysis to show all 
the probabilities in complex situations, 
often involving conditional probabilities; 

► identify situations in which probability 
calculations can aid the decision-making 
process, as in the case of expected 
values; 

► calculate probabilities involving 


0 lnnk at the basis for calculating the 
particular events, ore™, 
probabilities snorted to occur. In fact 


probability be expected t0 occur. In fact 

“Th Htie together with the probability distributions 
probabilities 8 6 and 7/ are the basis for many 

we consider i dedsions . For example insurance 

economic an ibIe future events is only pos- 

siWebecause'actuaries have worked out, often using pas. 
data the probabilities of those events occurring, ese 

probabilities then determine ^ insurance presto 

be charged for cover against such even . 
circumstances change, and with them the associate 
probabilities of such events occurring, then insura 
premiums will have to rise or fall. The greater like 1 oo 
of burglary has, for instance, caused a rise in premium 
for insuring house contents against theft. 

In this chapter we consider the basis for P r 
ity calculations involving both mutually exC uSI 
('OR' rule) and independent ('AND' rule) events.^ 

. ...... .„, (nll(i „. fact some events are neither mutually exclusive nor 

calculate probabilities involving , , . . , . , , , io(iranl C an help 

.... f . dependent, in which case the Venn diagram ^ 
permutations or combinations of events. * ' L>1 1 “ ° nI1 to eon- 

H us solve probability problems. We then move on it ^ 

sider conditional probabilities, as in the ca s 

Bayes theorem. Decision tree analysis is seen as being particularly help 1 
showing all the probabilities. The idea of expected value is also explored; 1 
applied to practical decision-making situations, using decision tree analy* 
some cases. The chapter concludes by looking at permutations and <» 
binations of events, which is the background to work on the Binomial»' 
Poisson distributions in Chapter 6 

thou« n ht W and!° ' Se ' f ‘ d,eCk < l uestions '- responses to each W* £ 
b00k 8 “ ‘° ,he ''^i™ questions' can be found at the end ot 
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5.1 Probability calculations 


Experimental probability 

This can be found by performing an experiment many times and keeping an 
accurate record of the results. The experimental probability of a particular event 
happening can then be worked out as follows: 


Experimental 


probability = 


number of times the event has happened 
total number of possible occurrences 


DID YOU KNOW? 

Experimental probabilities are worked out for all types 
of ‘hazards’. For example the ‘risk’ of dying from 
occupational exposure to asbestos has been worked out 
as 3 per 1,000 persons exposed. Actuaries work out 
experimental probabilities all the time in order to 
calculate insurance premiums for road accidents by age 
and other personal characteristics. 


Example 

When a normal dice was rolled 100 times, 
the number five was actually rolled 19 times. 
This gives an experimental probability ratio 


of 


19 

100 


0.19. 


Theoretical probability 

This is found by considering equally likely 
events. The theoretical probability is found 
by using the following ratio. 


Theoretical probability 

number of ways an event can occur 
total number of different equally likely events that can occur 


Examples 


1 


Rolling a dice and getting an even number has theoretical probability 


3 

6 


1 

2 


2 Rolling a dice and getting a number 5 has theoretical probability 

I = 0.17 (2 d.p) 

6 

Of course if an event is impossible , then its theoretical probability is 0. On the 
other hand, if the event is certain , then its theoretical probability is 1. 

Notice that if we know the theoretical probability of an event occurring, 
then we also know the theoretical probability of it not occurring. 


If the probability of an event occurring is, say, p then the probability of the 
event not occurring is 1 - P- 


Example 

The theoretical probability 


of rolling a 5 on a dice - 


l 

6 


The theoretical probability of not rolling a 5 


on a dice = 


1 


6 


5 

b 
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PAUSE FOR THOUGHT 5.1 


Can you work out the theoretical probabilities for three different 


e W?nt s? 


SELF-CHECK 

QUESTIONS 


figure 5.1 


5.1 Write down the theoretical probability of the following events: 

(a) rolling a dice and getting a 2 

(b) rolling a dice and not getting a 3 

(c) rolling a dice and getting an odd number 

(d) rolling a dice and getting a 2 or a 3 

(e) drawing an ace from a pack of cards 

(f) drawing a card from a pack of cards and not getting a diamond 

(g) being born on 31 April 

(h) rolling a dice and getting a number less than 7 

(i) buying a box of matches with an even number of matches in it 
(no quality control) 

(j) picking a blue ball from a bag containing two blue and seven white balls 

(k) picking a ball that is not blue from the above bag 

(l) meeting someone whose birthday is not on a Saturday. 

5.2 A bag contains 20 marbles, all the same size. Ten are blue, five are white, 
three are yellow and two are black. What is the theoretical probability of 
choosing at random 

(a) a blue marble (b) a black marble 

(c) a white marble (d) a marble that is yellow or black 

(e) a marble that is not white 

5.3 Two unbiased dice are thrown. Their scores are added to make a total 
between 2 and 12. 

(a) There are 36 possible totals as indicated in Figure 5.1 
Copy and complete the table. 


6 


5 


0) 

o 

XD 

a; 

c 

o 


2 


1 


4 

3 


( 1 . 6 ) 

(1,5) 

(1,4) 

(1,3) 


(1,2) 

(2,2) 

(3,2) 

(1,1) 

(2,1) 

(3,1) (4,1) (5,1) (6,1) 


3 4 

other dice 


1 


2 


5 


6 
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(b) Write down the theoretical probability of throwing a total of 

(0 2 (ii) 3 (iii) 4 

(iv) 6 (v) 7 (vi) 8 

(vii) 10 (viii) 12 

(c) What is the theoretical probability of not getting a total of 7? 

Note: Answers can be found on p. 338. 

5.2 Mutually exclusive events: OR rule 

If one event excludes the possibility of another event happening, then we say 
that the two events are mutually exclusive 

Example (i) tossing a coin and getting a head and a tail; 

(ii) rolling a dice once and getting a 2 and a 3. 

Events that are mutually exclusive can have their probabilities added to find the 
probability of one OR the other happening. 

OR rule: Mutually Exclusive Events 

For two mutually exclusive events A and B, the probability of A OR B occurring 
can be written as: 


P(A) OR P(B) = P(A) + P(B) 


Example 

If I roll a dice what is the probability that I roll a three or four? 


1 

Probability of rolling a three - ^ 


1 


Probability of rolling a four - 6 


, 1 1 _ 2 _ 1 
The probability of rolling a three OR a four = - + - - -g - j 


- + - = - 


WORKED 
EXAMPLE 5.1 



t experience to work out the probabilities of a particular 
over the year. His findings for ‘events’ A to E and 
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Solution 


t0 E ar e mutually exclusive, if one happens the others do not: 
= p(A) + P(B) = °- 2 + 0,3 = 0,5 

r R) = 1 - = 0-5 

Ip,D) + P(E) = 0.25 ♦ 0.15 = 0.40 


The events A to E are rr 

^ p(A or B) = P(A) + p 

(ii) i - P(A or B) = 1 " 

(iii) P(D or E) = P(D) + F 



identify three different situations involving mutually exclusive 


5.3 Venn diagrams: events 

Sometimes events can occur at the same time, so that they are not mutually 
exclusive. In solving this type of problem it can help to make use of a Venn 

diagram. 

Suppose, for example, that we have a pack of 52 cards and that event A is 
drawing a club and event B drawing a picture card. We now want to workout 
the probability of a card drawn at random being either a club or a picture card, 
i.e. P(A OR B). A pack of 52 cards will have 13 clubs and 12 picture cards. 

In this case we cannot use the OR rule for mutually exclusive events since 
here a club could itself be a picture card, i.e. events A and B are not mutually 
exclusive. Of the 13 club cards in the pack, the Jack, Queen and King of clubs 
are picture cards. There is therefore a danger of double counting if we use the 
conventional OR rule for mutually exclusive events. The correct answer here is: 

13 12 3 22 

P (A OR B) = — + — - — = — 


52 52 52 52 


OR rule: Non-Mutually Exclusive Events 

P(A OR B) = P(A) + P(B) - P(AB) 

The Venn diagram (Figure 5.2) can be useful in solving problems involving 
ts that are not mutually exclusive, i.e. which 'overlap* in some way^ 


figure 5.2 

Venn diagram for 
pack of cards, where 


A 


B 


A - club card and 
8 = picture card 
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SELF-CHECK 

QUESTIONS 


6 C3n n ° te the lowing from Figure 5.2. 

^ Th .0 ° 

sample, here the^n-ff 6 Corresponds to al1 th e possible events in the 

► The circle A m 1 eient cards which could be drawn from the pack. 

► The c c e R C ° rreSp0nds to the 13 club cards. 

Clrcle B corresponds to the 12 picture cards. 

the club cardf Weer V he tW ° CirdeS corres P onds to the fact that three of 
the club cards can themselves be picture cards. 

a Venn diagram k is essential that you first consider the area 
in Figure s? th ^ further - There fore enter the 3 in the area of overlap 
anrl Q in en . in t ^ le remain der of the circle A representing club cards 

nrohahii't ! remainder of the circle B representing picture cards. The required 
f Y i hat a card drawn at random from the pack is a club OR a picture 

,. ° H . am ^ d by addin § U P these numbers, namely 10 + 3 + 9 = 22, and then 
ivi mg y the total sample of 52 to obtain the result (22/52) already outlined. 


5.4 A company puts in two separate bids, A and B, for a particular contract. 

The probability that it will obtain the contract by bid A is 0.3, but 0.5 that 
it will obtain the contract by bid B. What is the probability that it will 
obtain the contract by bid A or bid B? 

5.5 A company usually recruits 25 management trainees per annum. In 2001, 
140 applications were received and of these: 

78 had previous work experience 
43 had passed a vocational exam 

21 had both work experience and had passed a vocational exam (and had 
been included in the above figures) 

(a) Use a Venn diagram to illustrate this situation. 

(b) What is the probability that an applicant selected at random had 
previous work experience or had passed a vocational exam? 

Note: Answers can be found on pp. 338-339. 

5.4 Independent events: AND rule 

If one event has no effect on another event then we say that the events are 
independent. To find the probability of BOTH events happening (one event 
AND then the other) we multiply their probabilities. 

FromTnormal pack of cards, what is the probability of drawing an Ace, putting 
it back and then drawing a King? 

JL = — 

Probability of drawing an Ace ^ , 3 

im¬ 
probability of drawing a King 52 13 
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WORKED 
EXAMPLE 5.2 


SELF-CHECK 

QUESTIONS 


methods for business and economics 

Since we have independent events, we multiply their individual probab . 

1 1 J_ 

1,e * 13 X 13 169 

Events that are independent can have their probabilities multiplied t 0 fi n 
probability of one AND the other happening. n ° 


AND rule 

For two independent events A and B, the probability of A AND B occurring can be 
written as: 

P(A) AND P(B) = P(A) x P(B) 


Each one of three separate boxes has 100 electrical components in it. The first 
box has six faulty items, the second 8 and the third 5. One component is taken 
from each box. What is the probability that: 

(a) all three components are faulty 

(b) all three components are good? 

Solution 

Selecting a component from each box is an independent event. Let us call 
selecting a component from the respective boxes as events A, B and C. 

(a) Here we let P represent the probability of selecting a faulty component. 

P(A AND B AND C) = P(A) x P(B) x P(C) 

= 0.06 x 0.08 x 0.05 
P(A AND B AND C) = 0.000240 

In other words the probability of selecting three faulty components is 0.024 
per cent or 0.000240 as a decimal. 

(b) Here we let P represent the probability of selecting a good component 

p (A AN D B AND C) = P(A) x P(B) x P(C) 

= 0.94 x 0.92 x 0.95 
P(A AND B AND C) = 0.82156 

other words the probability of selecting three good components is 
per cent or 0.82156 as a decimal. 


Drobahilit* is made U P °f five Independently produced components. J 
Drobabiii/ ° any 006 com P onen * being defective Is 0.03. What Is 
y of all five components being ‘good’ (l.e. of none being de 

The probaWNt Ch ? the Probability of a seed germinating '* ^ 

r:: ,,ower —-*-- - -js? 

flower? 6 P r °b a billty that any one seed will re9 
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58 Two school fri enrt 

the probability ,h * 8 Ways do their best at h 

that Conrad getsThe 6 * *** Prob,ems rightIsT/T*’ bUt ** mathema «<« 
eetS tha problems right is 9/10 ^ ’ Wh,te * he Probability 

X s 2:■— ** '• - * “ 

<«) neither gets”? Wr ° ng 
*/ * P r °blem wrong? 

/V ° te; An swers can be found 

Tound on p. 339 


5.5 CoMitiona. proba b |„ ty 

Here we look cit 

-iSK a d evem spends 

lonal probability. already occurred. We call this 

^ome events arp • / 

if they do happen, they influencethe^n independent - However 
happening. For example, if we purchase a^hea 7 1 °‘ her ' subse( I uent ' events 
then the probability of its requiring • heaper ' lower quality machine tool 

than if we had selected a more expeLTlheTa* 1 ^ WU1 be hi S h « 

words we are looking at dependent events invZ q machine too1 -In other 
probability. ts mvol ™g situations of conditional 

It will help to become familiar with some terminology: 


P(A) = Probability of event A 
P(BIA) = Probability of event B, given that 
P(CIAB) = Probability of event C, given that 
occurred, etc. 


event A has already occurred 
events A and B have already 


The contrast between independent events (AND rule) and dependent events (con 
ditional probability) can be illustrated in the following example involving draw¬ 
ing coloured balls from a bag. When we replace the ball selected on the first 
draw before we draw the second ball, then the second draw is independent of the 
first. However when we do not replace the first ball before we draw the second 
ball, then the second draw is dependent on the first. 


A bag contains 40 balls, of which 16 are red and 24 are blue. Three balls are 
drawn separately from the bag. What is the probability of drawing three red balls: 

(a) when each ball drawn is replaced before the next draw? 

(b) when each ball drawn is not replaced before the next draw? 
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Let A = getting a red ball on first draw 
Let B = getting a red ball on second draw 
Let C = getting a red ball on third draw 

P(ABC) = P(A) x P(B) x P(C) 

16 

where P(A) = P(B) = P(C) = — = 0-4 
So P(ABC) = 0.4 x 0.4 x 0.4 = 0.064 


In other words, with replacement, there is a 6.4 per cent or 0.064 
selecting three successive red balls from the bag. 


Probability of 


(b) Here we do not have replacement, so the probabilities of selecting sub 
red balls are dependent on whether a previous red ball has been selected 5 ^ 6 


nt 


P(ABC) = 


16 

15 

14 

- X - 

X - = 

40 

39 

38 


3,360 

59,280 


= 0.057 (to 3 d.p.) 


In other words, without replacement, there is a 5.7 per cent or 0.057 probability 
of selecting three successive red balls from the bag. 


Bayes ' theorem 


thporpn^ r\v° n ltiona ^ P r °t )a bility can be expressed more formally using Bap' 

be pvn a Stat f S ^ at the probability of two dependent events occurring can 

conditional ^u e probabilit y of the first event occurring multiplied by the 

already orr ^ 1 ^ ° f the second even t occurring given that the first has 
already occurred (and so on for more than two events) 


Where A and B are dependent events 
P(A AND B) = P(A) x (P(BIA) 
ut p ( B AND A) = P(B) X P(A|B) 
l e ‘ P(A) x P ( fi |A) = P(B) x P(A|B) 

We can express this result as follows: 


_ p (B) x P(AIBi 

~~Ra) 

‘• e - p (B given A) = ^_AND_A) 

a, RA) 

tematively We Can express ^ ^ 

P(AIB) = P ( A i^P(BIA) 
p (B) 

i>e - p (A given B) = P (AAND_B) 

P(B) 
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This result is often referred to as Bayes' theorem and is widely used in calcula¬ 
tions for problems involving conditional probabilities. 


40 workers in a large office can be classified as follows: 


Clerical grade Administrative grade 


Male 4 6 

Female 22 8 


(a) If a worker is selected at random from the office, what is the probability that 
the worker will be in the administrative grade? 

(b) If the worker selected is female, what is the probability that she will be in the 
administrative grade? 

(c) If the worker selected is male, what is the probability that he will be in the 
administrative grade? 

(d) If the worker selected is in the administrative grade, what is the probability of 
the worker being male? 


Solution 

14 

(a) P(admin. grade) = — = 0.35 

(b) We can work this out as ^ = 0.267 (to 3 d.p.) since of the 30 female 
workers only eight are in the administrative grade. 

However we can also work this out more formally using Bayes’ theorem. 


Via Bayes’ theorem 

P fadmin. grade AND female) 
(admin, gradelfemale) - p(female) 


where: 


P(admin. grade AND 


8 _ r\ n 

female) = — - u -^ 


30 _ o yc 
P(female) = ^ 

r lQ t _ Q-L = 0.267 (to 3 d.p.) 

So P(admin. gradelfemale) 0 75 

_6_ _ o 6 since of the ten male workers six are in 

(c) We can work this out as ±Q - 

the administrative grade. 
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P(a dmin. grade AND male) 
P(admin. gradelmale) = p(male) 


where: 


P(admin. grade AND male) 4Q 0,15 

1° n ok 
P(male) = — - °- 25 


0.15 


So P(admin. gradelmale) - 


= 0.6 


, 4U ' oc 6 - 0 429 (to 3 d.p.) since of the 14 
(d) We can work this out as — - v LU ° H ' 

administrative grade workers six are male. 


Via Bayes’ theorem 

P(maleladmin. grade) = 


P(male AND admin, grade) 

P(admin. grade) 


where 


P(male AND admin, grade) = — = 0.15 

14 

P(admin. grade) = — = 0.35 


So P(maleladmin. grade) = 


0.15 

0.35 


= 0.429 (to 3 d.p.) 


5.6 Decision trees 


It is often helpful to visualise a problem involving probability on a decisi® 
tree. When we can clearly see all the possibilities in a given situation it 1111 | 
it much easier to calculate the relevant probabilities. Although decision t ^ 
can be used in any situation, they are particularly helpful when solvinS 1 
lems involving conditional probability. 


WORKED 
EXAMPLE 5.5 


A bag contains five blue marbles, three yellow marbles and two white marbles- 
Two marbles are then drawn from the bag, the first marble not being repla ce 
before the second marble is drawn. 

What is the probability that: 

(a) both marbles are the same colour 

(b) both marbles are a different colour 

(c) at least one marble is blue? 
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figure 5.3 

Using a decision tree 


The decision trpp <c- 

problem. (F ' gure 5-3) helps us see all the 

he possibilities involved in this 



Notice that to find the probability of an event involving any two outcomes occurring 
(e.g. blue marble AND blue marble, BB) we multiply the probabilities. 

So P(BB) = P(B) x P(BIB) 

The probability of two blues is the probability of the first marble being blue x the 
probability of the second marble being blue given that the first is blue: 

5 4 20 2 

i-e. P(BB) = — x Q ~ 90 = 9 

However when we calculate solutions to (a), (b) or (c) below which involve either 
one OR more events occurring, we add the probabilities. 


lution 

Both marbles having the same colour can be achieved in three different ways, 
either two blues OR two yellows OR two whites. 


i.e. 


iytv ~ -- 

P.both marbles same colour, = P«BB, + P<VV, + P ( WW, 
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110 quantitative methods f colour will have the probability of i 

,b) Both marbles having a ^ o|our 


Wfli/s 


the 


(c) The probability of a, least one marble being blue can be achieved i„ five 
possible ways: 


P(BB) OR P(BY) OR P(BW) OR P(YB) OR P(WB) 



20 + 15 + 10 + 15 + 10 70 7 

i.e. P(at least one marble blue) = — - rr = - 


Clearly the decision tree is a useful visual aid, helping us see more clearly how 
we can apply the various probability rules already learned, especially in situations 
involving conditional probability. As we shall see in the next section, the decision tree 
is also helpful in applying the idea of expected value to business decision making 


SELF-CHECK 

QUESTIONS 


5.9 A box of 20 components contains six defective components. If three 
components are drawn consecutively from the box, what is the probability 
of all three components being good (non-defective): 

(a) if there is replacement 

(b) if there is no replacement 

Use a decision tree to help solve the next problem. 


5.10 Two cards are dealt out to a player from a full 
what is the probability of: 

(a) both being a diamond 

(b) one of them being a diamond 

(c) neither of them being a diamond? 

Note-. Answers can be found on p. 339. 


pack (without replacement). 


DID YOU KNOW? 

The expected value nf k 
lead content in petrol by 1 graml ^ 
has been estimated at over $i hT ' n 1 
calculation involved estimating J '° n Per annL 
health and economic benefits o'* Pmbabll,t 'es 
ahull blood pressure, raised cnZ^ 
economy, etc.) and then p, acini S ' Qs ' gre 

each of these benefits. ® 3 moneta ry vai 


5.7 Expected value 

The expected value (EV) of a particular 
event which has different possible ° 11 * 
comes is a weighted average of the value 
(pay-offs) associated with each pos$ib e 
outcome. The probabilities of each out- 
c °me are used as the weights and the*, 
are multiplied by the respective pay-°h s 

each outcome. 
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for 0xamr*i 

having pay-offs X x and X vv't^v! 11 w ^^ c h there are two possible outcomes 

respectively, then the exnectPH \ & Payabilities of each outcome p x and p 2 

ue ex Pected value EV(X) is- r 

EV(X) = p xXx + PiXi 


For example, if there is a 60 

cent chance of earning fs nmT Cent chance of earnin 8 £1,000 and a 40 per 

ng tb ' 000 from an investment, then: 


i.e. 


cIvn 0,60 ( £1 ' 00 °) + 0.40(£5,000) 
E(X) - £600 + £2,000 = £2,600 


(EV1 of a m 1as ^ n § the short-hand notation of Chapter 2, the expected value 
r icu ar course of action over n possible outcomes pan be defined as: 


Formulae for expected value 

EV = ip, • X,. 

1=1 

where p, = probability of /th outcome 
X, = value of /th outcome 

n 

and £p f = 1 

/= 1 


This last condition merely states that we have covered all possible outcomes, so 
the sum of their probabilities must be 1. 


WORKED 
EXAMPLE 5.6 


A fair coin is tossed and, for a £1 stake, the gambler is promised £1.80 if the 
result is a head, but only £0.10 if the result is a tail. What is the expected value 

to the gambler of each throw? 


Let Pi = the probability of 
p 2 = the probability of 
Xj. = the ‘pay- off ’ of a 
X 2 = the ‘pay- off ’ of a 


head (0.5) 
tail (0.5) 
head (£1.80) 
tail (£0.10) 


Solution 



EV = Ml + P2* 2 , 0 5 

EV = 0.5 x £1-80 + 0 5 

EV = £0-9° + £0 ° 5 


x £0-10 


i.e. 
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The gambler can therefore ‘expect’ to lose 5 pence on each throw of the di CP 
though he/she might be lucky and get more heads than tails on the early 
of the dice, and thereby win by ‘quitting while they are ahead’. ° Ws 

On the other hand, the individual offering the gamble can ‘expect’ to mak 
profit (revenue - cost) of £0.05 on each throw (£1.00 - £0.95). 6 a 

Clearly the greater the number of trials , e.g. throws of the dice in our 
example, the more likely it will be that the theoretical probabilities (see p. 99) 
apply rather than the experimental probabilities (see p. 99) of a ‘lucky’ or ' 
‘unlucky’ run. 


Expected value and decision trees 

We can relate this idea of expected value (EV) to the decision tree analysis 
already presented (p. 108). As we have seen, a decision tree is a diagram which 
can usefully represent a series of choices and their possible outcomes. Where 
the outcome is pure chance, then the overall probability of 1 will be divided 
equally between all the possible outcomes (e.g. 0.5 for each of two pure chance 
outcomes). Where the outcome is uncertain but where some outcomes are more 
likely than others, then the decision maker may assign probabilities to each 
possible outcome. The sum of these probabilities over all possible outcomes 
must, of course, be equal to 1. 

The sequence of decisions and outcomes is represented graphically as the 
branches of the 'tree' (see Figure 5.4). At every point (node) that a decision 
must be made or an outcome must occur, the tree branches out further until all 
the possible outcomes have been displayed. 


figure 5.4 

Decision tree 
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► Boxes (decision nodes) are usually used to indicate situations where 
the decision maker consciously selects a particular course of action 
(strategy) and where the outcome of that action is 'certain'. Branches 
coming out of these boxes simply indicate the alternative decisions or 
strategies which might be taken, each of which has a probability of 1 
once selected. 

► Circles (chance nodes) are usually used to indicate situations reflecting 
a 'state of nature', i.e. situations whose outcomes are not entirely under 
the conscious control of the decision maker. Branches coming out of 
such circles show the various possibilities which might occur, together 
with estimates of their probability of occurrence. 

► Pay-offs are the valuations placed at the end of particular branches 
emanating from chance nodes. They are the values which management 
allocates to that event or outcome should it actually occur, and are often 
denoted by a straight (vertical) line at the end of the branch. 

In Figure 5.4, at the decision node box, the decision maker must decide between 
a strategy of increasing price or keeping price unchanged. If this latter strategy 
is pursued, only one profit outcome is possible which is evaluated at £200,000. 
If the former strategy (increasing price) is pursued, then the decision maker 
intends to support the policy change by an active advertising campaign. How¬ 
ever the outcome of such an advertising campaign is not entirely under the 
control of the decision maker. He estimates that there is a 60 per cent (0.60) 
chance of the campaign being a success and a 40 per cent (0.40) chance of it 
being a failure. These two branches are therefore shown as emanating from a 
chance node indicated by a circle. The firm estimates profits of £800,000 should 
the advertising campaign (allied to a price increase) be successful, but losses of 

£600,000 should it fail. 


Backward induction 

^ , thp firm should choose in order to maximise the expected profit 

Which branch ined Th e process of solving this problem is called backward 
can easily be det begin at the right-hand side of the decision tree, 

induction. This r ^ q are located. The first step is to calculate the expected 

where the profit figu the chan ce node immediately to the left of 

profit when the firm is situaie 

these pay-off figures. orobab ility that the branch culminating in a profit 

Because there is a o. y ^ probability that the branch culminating in a 
of £800,000 will occur, an • lue (EV ) of profit when situated at 

loss of £ 600,000 will occur, the expec 

this chance node is: 

EV (profit) = 0.60 (£800,000) + 0.40 (-£600,000) = £240,000 

hplow the chance node in question to show that this is 
This number is a t that node, 

the expected profit when 
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Moving further to the left along the decision tree, it is clear that the firm 
has a choice of two branches, one of which leads to an expected profit of 
£240,000, the other of which leads to an expected profit of £200,000. If the firm 
wants to maximise expected profit, it should choose the former branch. In 

other words, it should increase its price and accompany this with an advertising 
campaign. ° 


WORKED 
EXAMPLE 5.7 


Firm A faces a choice between two policy options. One is to make a certain 
a bank' t? 1 ° f £100,000 from '"Merest on a sum of money left on deposit in 

development ^perfume)' 0 ^ ** S8me ^ ° f m ° ney in " 6W Pr ° duct 

a mljo F r r c m ompettor UC Rrm ,h B e ’’'f"™' iS 3 ? ° PCT Cent P r °t>ability that 
30 Per cent probabilhy^hat TJZZ? ‘ Sim " ar ProdU « a " d a 

a high 0 phce' t ,HP)':' d a e ^"1°^'^ ^T"" 6 Pr ° dUCt ’ ^ A Ca " Choose 

product. P 6 (MP) or a low Price (LP) for its new perfume 

^ How much prom Firm A est , ma(es , hat , t wji| depends ^ ^ ^ 

s. zzim a sinfc — 

(iii) What counter strategy o^price^irrrfB selects. 

' rm A calcu| ates the following table to 
Competition - 


Firm A's 
price strategy 


Firm B’s 
Price response 


Probability Fim A 



’ s profit 


150,000 

100,000 
00,000 
120,000 
100,000 
50,000 
25 0,000 
150 ,000 
1 ° 0,000 
500.000 
3 00 ,OOo 
200,ooo 


Solution 


figure 5.5 presents the decision tree for this 


Problem. 
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figure 5.5 

pecision tree: 
perfume problem 



£ 

150,000 

100,000 

90,000 

120,000 


60,000 

250,000 


100,000 

600,000 


200,000 


Using the technique of backward induction we begin our solutions at the right- 
hand side and work leftwards, back to the initial decision node (Box 4). However 
we can see that there are two 'earlier 1 decision nodes (Boxes 1 and 2) and four 
chance nodes (Circles HP, MP, LP and (3). The pay-offs are listed at the end of 

the branches farthest to the ngh between a high price (HP), medium 

At decision box 1 Firmi A m depend ^ ^ ex 

price (MP) and low price <£> h "e node) of each pricing strategy, which 

r^rnd'eTend^"oTthe probabilities of the price responses of Firm B and the 

. « 


EV = £p,X, 

/= 1 
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with all values in £OOOs we can say: 

EV(Firm A, HP strategy) = 0.6(150) + 0.3(100) + 0.1(90) 

= 90 + 30 + 9 


i.e. EV (HP) = £129(0001 


EV (Firm A, MP strategy) = 0.3(120) + 0.5(100) + 0.2(60) 

= 36 + 50 + 12 

i-e. EV (MP) = £98(0001 

EV (Firm A, LP strategy) = 0.0(250) + 0.4(150) + 0.6(100) 

= 0 + 60 + 60 

'• e - EV (LP) = £120(000) 

At decision node 1, Firm A will select the strategy with the highest expected nrnfit 

pay-off, i.e. a higher price (HP) strategy (in the event of competition from Firm m 
yielding £129,000 in expected profit. K 

At decision node 2, Firm A has no probabilities to contend with since the pav- 
o s listed are not associated with the possible reactions of Firm B. Here it can 
select the higher price (HP) strategy (in the event of no competition from Firm B) 
yielding £600,000 in expected profit. 

However there are probabilities associated with whether or not Firm B will 

woTlT" t Pr ° dUCt ° f ^ 0Wn ' Th6ref0re We mUSt take these int0 account in 
working out the expected value at chance node 3. 

EV (perfume product) = 0.7(129) + 0.3(600) 

= 90.3 + 180 

'- e - EV (perf ume product) = £270.3f00m = £ 270.800 

At the initial decision box (4) we can see that, based on our expectations, there 
will be greater profit from investing the available money in the new perfume 
product than from leaving the money earning interest in the bank. 


An approach such as that in Worked Example 5.7 is often said to be using 
different scenarios, i.e. the evaluation of different possible outcomes for various 
policy initiatives. Of course if the probabilities were to change, for any reasons, 
or the expected pay-offs were to change, then any initial decision must be 
re-evaluated in the light of these new circumstances. 

There is also the issue of the risk associated with any decision. For example 
the £100,000 earned via bank interest is likely to be much less risky than the 
higher expected value of £270,300 earned via new product development. We 
return to the ideas of investment appraisal and risk analysis in Chapter 9, p . 208. 


SELF-CHECK 

QUESTIONS 


5.11 A machine produces 35 defective Items per 2,000 items produced. 

(a) What la the probability of an Item selected at random being defect!, 

(b) If 4,500 Items are produced per day, how many defect ems won 
you expect? 

5.12 A company is considering Introducing a new computer syMedium or farg« 
northern branch offices. The company can lease a s 








computer system. probability ll 

expand over the n„ ? esti mates that if «. 

£100,000 £isn or, year the extra profits , 6 econon, Y continues to 
down the exi ’ 00 and «00,000 re °~\ * 6acl ’ syst »" be 

£20,000 respect^™ ,dS ® a " era, ed wi " be £60*000 MoVoTT S '° WS 

'“emmend one 'l‘ h,S Prob,a "' « L the T * deC ' S,on trea 

* °' * he eomputer system, eXPeC,ed va,ue approach to 
5.13 Clover design h* 

test marketing blt^Tc''T* * " ew p,0<lu ct and are co h 

abandoned (at any s , “f* °' £6S ' 00 ° * abandon,4"^ eWba ' 

mat; 1 ;: r:r*• zzzrz t2 :r hm * “ 

development ^ ^“•££^*—■ 

on the ^response ,S eat,mat ed tha, the e«t "IT"* °’ * he Pro i ec * * by 

*250,000 on the^th^'h'™* " * here ls no 'espons^th™* 611 depend 

:r y r 
„ t - a :rs 

«ote: Answers can be found Pany - 

De found on p. 34o. 


• 5 '- 8 .. P . e ? Ut . a !!°. ns .. a " d .combinations 

. . 


Factorial notation 

you enter 6 and press this key you obtain^T haVe * key marked x!, anTlf 
Use your calculator to show that 7! = 5,040 
Note - 1! = 1 and 0! = l. 


Permutations 


Suppose you are given ten desirable 
Place 4 of these in order of merit, i 
different ways can this be done? ' 


'-J 14(411 l 


•e. first, second, third, 4 ^ 1,7 h 

’ h «w man) 
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, nf the four qualities selected does matter here. We 1r 
Clearly the or en g ss]b i e permutations. The first place can in f ac , 

h ere h° re n Tn ten deferent waysf having chosen the fits, place, there ate n £ 
be chosen m te y d place can be chosen in nine differ, 

possible places left, can * c U n in eight different ways and lh ™ 

fourth Jaceln'seven different ways. The total number of ways is 10 x 9 x 8 « 7 

= 5,040. . , „ 

This may be written using factorial notation 


10x9x8x7 = 


10x9x8x7x6 x5/4x3x2xl _ _HT _ . 


Formula for permutations 

In general, if there are n items and r are to be placed in order, the numbet of 
different ways (permutations) in which this can be done is. 

P = ——— 

" ' (n - r)! 


The example above would be written 


10 


r, i.e. —22!— = 22!. 5.040 

(10 - 4)! 6! 


In other words 5,040 different permutations are possible when choosing four 
out of ten items and then ordering them accordingly. 


Combinations 

Here we are not interested in the ordering of the items selected 

Consider again the example used for permutation-; s 
select four desirable qualities of a new product out of th t UPP ° Se We wisb t0 
given ( irrespective of the order of merit). The number of wa^ ^ desirab,e qualities 
to the number of permutations, 5,040, divided by the nu °h d ° lng tbis is e< ? ua) 
ing the four selected items in order of merit. In fact we 01 ^ ° f Wa ^ s °* P* ac ' 
selected items in 4 x 3 x 2 x 1 = 4! = 24 different ways^h 3rrange tbe four 
divided by 24 is 210, and this is called the number of combina feSU,t of 5 > ()4 ° 

Using factorial notation, the number of combination c ™ atlons - 

oons is equal to- 

101 = 10x9x8x7x6x5x4x3x2x1 = 3.628.800 = 2 , ft 

6! x 4! (6 x 5 x 4 x 3 x 2 x 1) x (4 x 3 x 2 x 1) 720 x 24 J 


Formula for combinations 

In general, if there are n items and r are to be selected irrespective of 
number of ways (combinations) in which this can be done is: 


order 


the 


nC r = 


n\ 


(n - r)\ x r! 
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It is useful to find „C r for r = 0, 1, 2 3 
r = 0: „C„ = " ! 

r = 1: „C 1 = 


n! x 0! 
n\ 


= 1 


(n - 1)! x 1! 


= n 


r = 2: „C 2 =---= n{n - 1 ) 

(n - 2)! x 1! 2x1 


r = 3: „C 3 = 


n\ = n(n -1)(« - 2) 
(« — 3)! x 3! ~ 3x2x1 


The values 


l n n ( n ~ ^ n ( n ~ l)( n ~ 2) n ( n ~ l)(w - 2 )(m - 3) 

' U ' 2x1 ' 3x2x1 ' 4 x 3 x 2 x 1 

are called binomial coefficients. We need these in the next chapter when we 
consider Binomial and Poisson probability distributions. 


SELF-CHECK 

QUESTIONS 


Review 

Questions 


5.14 How many ways can we select three elements from a set of eight 
elements: 

(a) where the selected elements are ranked in order? 

(b) where the selected elements are not ranked in order? 


L5 A batch of 20 video recorders contains four defectives. A sample of five 
videos is to be selected from the batch. 

$ n. st-et^ 

ways can this occur? 

16 Five students (A, B, C, D and E) have been elected to the student 

. . |f three students are to be selected for a committee, how many 

(a) If thre are possible? 

different committees a n chairperson, treasurer and 

(b) " different 1,06 ups ,rom ,l,e flve studen,s a,e 
possible? 

found on p- 34:1 - 

tote: Answers can 

. l uue city, one home in every 50 

51 An insurance ^ ““ 


other 

even? 
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5 2 The probability that a record shop will sell at most eight copies of a particu¬ 
lar CD each day is 0.60 and the probability that it will sell from nine to 12 
copies of the CD each day is 0.35. What is the probability of: 

(a) at least nine sales; 

(b) at most 12 sales; 

(c) more than 12 sales? 

5.3 A sample of four items is selected from a batch that contains eight defective 
items and 24 without defects. What is the probability that: 

(a) all four items selected will be defective (assume replacement)- 

(b) all four items are not defective (again assume replacement)? ' 

(c) How would your answers to (a) and (b) alter if there was no 

replacement ? 10 

ata univer %. 

probability , hat , fie SMe * m w|]J tadent ls s *«ed at random what is the 

( a ) be male; 

(b) be female■ 

"*** “ *"*• find Probability that she is not studying 
5,5 ^ ec ords show that #„ 

! m - to* « defeat r items r^uced in a facto a 

? ur are defective. If w, d f 6Very 100 Produced d 7 durin g the dav 

durin g the day and 800 d,^ 3 ^ hour P eri °d 2 ^ nigllt shift 

a ms produced in tfie , ' °° produced during the 24 ^ fhat an 

(0 is in «* "Sohiftt;; 5 **** ° Ur PWi0d ' 

( v fa ^Z7 !;: ofthe ^ asdefective; 

<4 ZZ m? '*“* iS the probabil 'ty that - t 

. . service n 

.. 7 ^ 

~~ r «aije r 1,1 7, 

2b 24 
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(a) 

(b) 

(c) 


If 

retailer givesVood^sewice? 111010111 ' What iS the pr ° bability that the 

one selected • e ^ e ^ teb at ran dom, what is the probability that the 

Wha t is th. ^ d K ePendent and gives good se rvice? 

chain will pr ° bablllty that a retailer that is part of a high street 
cnain will give poor service? 


. . re P resen ^ a bve finds that the probability of making a sale on the 

visi a new client is 0.5. On the second visit the probability of 
ma mg a sale is 0.6 if a sale was made on the first visit and 0.4 if no sale 
was made on the first visit. Use a tree diagram to work out the probability 
of just one sale resulting from the two visits. 


5.8 An employer is attempting to estimate the proportion of employees who 
drink alcohol regularly during their lunch break. In an attempt to ensure 
honest answers the employer asks each employee to secretly toss a coin. 

If a head is obtained then the employee answers the question, 'were 
you born in an odd numbered year?' 

If a tail is obtained then the employee answers the question, 'do you 
drink regularly at lunch time?' 

The percentage of employees answering yes to either question was 
37 per cent. Use a tree diagram to estimate the percentage of regular lunch 
time drinkers. 

(N.B. let the probability of a lunch time drinker be p ) 


s 9 A company is contracted to finish a £100,000 project by 31 December. If 
it does not complete on time a penalty of £8,000 per month or part of a 
month is incurred. The company estimates that if it continues alone t ere 
l a 40 ner cent chance of completing on time and that the project 
is on y P eve n four months late with equal probability, 

might be one, two te ^ ^ ^ firm at a CQSt of £ i 8 ,000. If the 

Subcontract ^ the pro bability that the company completes 

subcontrartorsar^ ^ ^ prQject is still late it will only now be one or two 

month late with equal the prob i e m of whether the company 

Draw a deci sionjree to J? subcontractors and ana i yse using the ex¬ 
pected value technique. 

• first prize by selecting the correct six number 

1 to 4 

afc has buttons numbered ",is 

:r„ r : -— wm open 

the probability that som 
the safe? 
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5.12 Liker Airways has 15 video programmes available to show during its tram 
atlantic flights. If six videos are to be shown during the flights: 

(a) If the order is unimportant, how many different combinations are 
there? 

(b) If 900 of the combinations of six programmes are thought to be 
incompatible, what is the probability that a selection of six taken at 
random will be compatible? 

5.13 A textile designer has chosen to use eight different colours in this year's 
fabric designs. Production constraints mean that only five different colours 
can be used in each design. 

How many different colour groupings can be advertised? (Assume all 
fabrics have five different colours.) 

Answers to Review questions are to be found on pp. 415-419 at the 
back of the book. 


Further study and data 
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Addison Wesley Longman, chapter 4. 

Morris, C. (1999), Quantitative approaches in business studies, 5th edn, Pitman, 
chapter 8. 

Oakshott, L. (1996), Essential elements of business statistics, BPP, chapter 4. 

Swift, L. (2001), Quantitative methods for business, management and finance, 
Palgrave Publishers, parts P1-P4. 

Thomas, R. (1997), Quantitative methods for business studies, Prentice Hall, 
chapter 2. 

Waters, D. (1997), Quantitative methods for business, 2nd edn, Addison Wesley 
Longman, chapter 13. 

Wisniewski, M. with Stead, R. (1996), Foundation quantitative methods for 
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Sources of information and data 

See the list the end of Chapter I (p. 25). 


On-line exercises 


Check the web site www.booksltes.net/buiton to 

and on-line exercises tor this ehaptei. 


find extra 


questions, Spreadsheet 




CHAPTER 6 


Probability distributions 


Objectives 

When you have read this chapter you 

should be able to: 

»> understand the main properties of a 
Normal distribution; 

► work out the probabilities of particular 
events occurring when these follow a 
Normal distribution; 

► know that a ‘confidence’ interval is a 
range of values within which a particular 

value might lie; 

► use and interpret the probability (Z) 
tables associated with the Normal 
distribution; 

► be familiar with the Binomial and 
Poisson distributions, and be able to 
work out the probabilities of events 
occurring when these follow such 
distributions; 

► be able to identify when and how to 


Introduction 

Chapter 5 looked a. the idea of P^b^s a "d th«r 
application to business situations. Qne 

of°the inost Important^robability (MribuUom. you will 

abilities We considered some of the proper 1 
" distribution in Chapter 2. Here we: note how 
these properties can be used to develop the so called Z 
tables from which we can derive estimates of probabil- 
tv The Binomial and Poisson probability drstr.but.om 
7 e Ilso introduced and related to the work on probab.l- 
it y in Chapter 5. Other probability distributions, such as 
the Student-t and Chi-squared distributions, are c 

S ' deI Answers to^Self-check questions', responses to each 
'Pause for thought' and answers to the 'Review questions 
can be found at the end of the book. 


the Normal approximation to the 
Binomial and Poisson distributions. 


6.1 Normal distribution 


on distriDuuoiio- . , , 

„ , AM that the Normal distribution is perfect v 
We noted in Chapter 2 (P- JW * measure5 of central location, namely 

symmetrical ('bell' shaped), with a same ya|ue for some variable A„ 

arithmetic mean, median and modC' b g Greek letter „ ,'mew'), then we know 
« - call this arithmetic mean val ^ SO^t ^ 

I," r side of* ^ Nonna, curve expressed as 1 and with 

the whole ^strlbution then lying el,her , ^.“button around /. will allow us to 
°' 5 fdee'd the sy-nmetry of any NonM^uUor ^ x around 

calculate p 

We can do this y 23 
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figure 6.1 

Areas and 
probabilities under 
the Normal 
distribution 



This measure of dispersion (see Chapter 2) can then be used to calculate prob¬ 
abilities for any Normal distribution. For example, as shown in Figure 6.1, we 
know that 68.26 per cent of all the observations will lie within one standard 
deviation of the mean; 95.44 per cent of the observations will lie within two 
standard deviations of the mean; and 99.74 per cent of the observations will lie 
within three standard deviations of the mean; etc. 


PAUSE FOR THOUGHT 


_ Can you suggest three different 

lormal or close to Normal? 


types of distribution you might expect to be 


Confidence interval 

We use the term confidence Intervals to refer to those ranges of values from 
the arithmetic mean (p) expressed in terms of the standard deviation (cr), within 
which we can expect a particular value of the variable X, to lie. We return to 
consider confidence intervals further in Chapter 7 (p. 152). 


6.2 Standard normal distribution 


The calculation of probabilities under any Normal distribution can be taken a 
step further by transforming the values of the variable X, into the numbeis of 
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figure 6.2 

The standard Normal 
(Z) distribution: 
mean 0, standard 
deviation 1 



standard deviations (o) from the mean (ft). The resulting distribution is often 
re erre to as the standard Normal distribution and the transformation 
which brings it about is often called the Z score or Z statistic: 


2 _ x i ~ tL _ Value of variable - Arithmetic mea n 
o Standard deviation 

In other words the Z statistic simply calculates the number of standard deviations 
from the arithmetic mean. If the distribution of the variable X, is Normal, then 
the distribution of the Z statistic will also be Normal, as shown in Figure 6.2. 

A table giving the area shaded in Figure 6.2 for any value of the Z statistic 
is presented in full in Appendix 1 (p. 314). For example, for a Z score of +2.0 we 
will have 0.0228 (2.28 per cent) of the distribution to the right of that Z score. 

Of course, since the Z statistic is normally distributed, a Z score of -2.0 will 
give us an equivalent area, but this time to the left of that Z score. 

In other words, if the particular value for X, is two standard deviations 
above (or below) the mean for a normally distributed variable, then we can 
expect 0.0228 (2.28 per cent) of the distribution to be above (or below) that 

value. 


If z = +, then area in right-hand tail 
If 2 = —, then area in left-hand tail 


t +» ctindard Normal (Z) distribution is a Z score of 0. 

Note that the mean of the^standard^N ^ ^ ^ ^ w<? „ 2er0 

In other words w mean> so the Z score is 0. It therefore follows 

standard deviations () of the distribution will lie to the right, and 0.5 

that for Z = 0, 0.5 (50 per / 

<5 ° To' help 'you ^become fan, mar with using the full * table („. 314), a small 
section of that table is shown below: 
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WORKED 
EXAMPLE 6.1 


figure 6.3 


Z 0.00 O.Ol 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 

1.0 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379 

1 1 0 1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170 

1 2 0 1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985 

1 3 0 0968 0 0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823 

1.4 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681 


For example, if the Z score is found to be +1.15, then go down the Z column to 
the 1.1 row and across the 1.1 row to the position underneath the 0.05 column. 
We read off the value 0.1251, which tells us that 0.1251 or 12.51 per cent of 
the distribution is in the right-hand tail. Of course had the Z score been —1.15, 
then 0.1251 or 12.51 per cent of the distribution would have been in the left- 
hand tail. 

Check that for Z = +1.38 you would read 0.0838 (8.38 per cent) in the right- 
hand tail, and for Z = -1.42 you would read 0.0778 (7.78 per cent) in the left- 
hand tail. 


Suppose a variable X, has a Normal distribution with a mean of 100 and a 
standard deviation of 10. Suppose that we want to find the probability of X, being: 

(a) 115 or more 

(b) 75 or less 

(c) between 75 and 115 

Solution 

We can use Figure 6.3 to help solve this problem. Note that we show both the 
variable X, and its associated Z scores, Z, on the same diagram. 
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worked 
example 6.2 


(a) For X 1 = 115 , z , = _ H5 - 100 

O' 15-= +1 ' 5 

From the Z tables (p. 314 ), we fj nd that z _ +1 5 gi 0.0668 as the area 
in the right-hand tail. 

i-e. Zi = + 1.5 
Px = 0.0668 
Px = 6 . 68 % 

There is a 6.68 per cent chance of X, being 115 or more. 

(b) For X 2 = 75, Z 2 = - 2 ~ £ = 75 ~ 100 = _ 2 5 

o 10 

From the Z tables (p. 314), we find that Z 2 = -2.5 gives 0.0062 as the area 
in the left-hand tail. 

i-e. Z 2 = -2.5 
P 2 = 0.0062 
P 2 = 0.62% 

There is a 0.62 per cent (i.e. less than 1 per cent chance) of X, being 75 or 
less. 

(c) The whole area beneath the Z curve is 1.00, so the area between X 2 = 75 
and X 1 = 115 is 1.00 - (0.0062 + 0.0668) = 1 - 0.0730 = 0.9270. 

In other words there is a 92.7 per cent chance of X, being between 75 
and 115. 

In solving problems of this kind it is always helpful to draw a sketch 
diagram to indicate what has been given in the question and what needs to 
be found, as in Figure 6.3 above. Sometimes we will need to make use of the 
fact that the area to each side of the mean under the normal curve is 0.50 
(50 per cent). You will see in Worked Example 6.2 that sometimes we must 
subtract the area in the respective tails of the distribution from 0.50 in order 
to obtain the area required. Note also that whenever we seek to find an area 
which extends across the mean (/j), then it is usually helpful to solve for each 
side of the mean separately before combining the result. 


The attendance at a night club is thought to be normally distributed, with a mean 
of 80 persons and a standard deviation of 12 persons. What is the probability 
that on any given night the attendance is: 

(a) 74 persons or more 

(b) 83 persons or less 

(c) between 70 and 85 persons? 

Solution 

Figure 6.4 represents the areas we require in each case. 
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7480 


Zx 0 

(a) X, > 74 persons 


8083 

I I 


oz ± 

(b) X ( < 83 persons 


70 80 85 

- 1 - 1 —'-Z 

Z 2 0 Z ± 

(c) 70 < X, < 85 persons 


figure 6.4 

The areas required 
are shaded (A 2 and 
A 2 ): the areas given 
in the Z tables are 
unshaded {P ± and 
P 2 ) 


Remember that when we calculate the Z score, the Z tables (p. 314) give us the 
area in the tails of the distribution. 

(a) To find X, > 74 means we require the shaded areas A ± and A, in Figure 6.4(a). 

For X ± = 74, Zi - - 1 ~ = — - 80 = -0.5 

o 12 

Z t = -0.5 
P 1 = 0.3085 

A ± = 0.5000 - 0.3085 = 0.1915 
4 2 = 0.5000 
+ 4 2 = 0.6915 

There is a 0.6915 (69.15 per cent) probability that the attendance will be 74 
persons or more. 

(b) To find X) < 83 means we require the shaded areas A x and A 2 in Figure 6.4(b). 

X t - n 83 - 80 


For X 1 = 83, Z x = 


12 


= +0.25 


Z 1 = +0.25 
P 1 = 0.4013 

A x = 0.5000 - 0.4013 = 0.0987 
A 2 = 0.5000 
A, + A 2 = 0.5987 

There is a 0.5987 (59.87 per cent) probability that the attendance will be 83 
persons or less. 

(c) To find 70 < X, < 85 persons we required the shaded areas 4, and 4 in 
Figure 6.4(c). 2 

For x — 85 7 _ ~ P — 80 +5 

0r ~ 85, Zl - = -— = — = +0.42 (rounded) 


a 12 

Zj = +0.42 
P, = 0.3372 

4j = 0.5000 - 0.3372 = 0.1628 
4] = 0.1628 


12 
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WORKED 
EXAMPLE 6.3 


f 'gure 6.5 

Probability known, 
Va,Ue of variable 
unknown 


For X 2 = 70, Z, = X2 ~ E - 80 -10 

2 CT = ~~12~ = U = ~ 0 - 83 
Z 2 = -0.83 
P 2 = 0.2033 
A 2 = 0.5000 - 0.2033 
A 2 = 0.2967 

X + Ag = 0.4595 

There is a 0.4595 (45.95 per cent) probability that the attendance will be 
between 70 and 85 persons. 


Finding the variable value when probability known 

So far we have known the specific values of the variable X, and found the 
probabilities with which these values might occur. Here we reverse the process. 
Suppose we know the probabilities with which specific values of the variable X t 
might occur, how can we then find these values of X,? Worked Example 6.3 
shows how we go about solving such problems. 


The life of a machine component is normally distributed with mean 60 hours and 
standard deviation 30 minutes. If 2.5 per cent (0.025) of components last more 
than X 1 hours, find the value of X- 

Solution 

In a question like this, do make sure that you express all values in the same unit. 
here hours or minutes. If we use hours, then the standard deviation is 0.5 hours. 

as shown in Figure 6.5. 
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P x = 0.025 

We can therefore look in the body of the Z tables (p. 314), to find the Z value (Z x ) 
which would give us the probability in the right-hand tail of 0.025. 

Z x = +1.96 

We know that 



<7 


0.5 

+1.96(0.5) + 60 = \ 

60,98 = X, 

In other words we would expect 2.5 per cent (0.025) of the machine components 
to last more than 60.98 hours. 

Note: if the left-hand tail had been involved then the calculations would have used 
a negative value for Z. You can check this yourself by finding the number of hours 
(X 2 ) for which 2.5 per cent (0.025) of the components last less than X 2 hours. 
Check that your answer is 59.02 hours and that -1.96 was used in your calculation. 


SELF-CHECK 

QUESTIONS 


6.1 Find the area beneath the (Normal) curve over the following values for the 
Z score. Draw diagrams in each case and use the Z tables (p. 314). 

(a) Z x = 0 and Z 2 = 0.2 

(b) Z x = 0.5 and Z 2 = 1.0 

(c) Z x = 0.95 and Z 2 = 2.55 

(d) Z x = 1.55 and Z 2 = (+) infinity 

(e) Z x = -1.5 and Z 2 = 0 

(f) Z x = -1.85 and Z 2 = -1.05 

(g) Z x = -1.55 and Z 2 = (-) infinity 


6.2 Attendants at rock concerts in a stadium are normally distributed with mean 
20,000 persons and standard deviation 4,000 persons. For a future event, 
what is the probability: 

(a) more than 28,000 persons will attend 

(b) less than 14,000 persons will attend 

(c) between 17,000 end 25,000 persons will attend. 


.3 Use the same data as In question 6.2. 

(a) I. you know that them Is a 5 per cent (0.05) probability that mom than 
a certain number of people <XJ will attend, what Is the value of X,? 

(b) If there is a 1 per cent (0.01) probability that less than X 2 people will 

attend, what is the value of X 2 ? 
lote: Answers can be found on pp. 342 3 
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f:l B . i ! 1 . 0mial distribution 

, ' f on ®i a l distribution can be used to describe the likely outcome of 
events for discrete variables which: 

(a) have only two possible outcomes 
and 

(b) are independent. 

Suppose we are conducting a questionnaire. The Binomial distribution might be 
used to analyse the results if the only two responses to a question are 'yes' or 
no and if the response to one question (e.g. 'yes') does not influence the likely 
response to any other question (i.e. 'yes' or 'no'). 


Put rather more formally, the Binomial distribution occurs when there are n 
independent trials (or tests) with the probability of ‘success’ or ‘failure’ in 
each trial (or test) being constant. 

Let p = the probability of 'success'. 

Let q = the probability of 'failure'. 
then q = 1 - p 

For example, if we toss an unbiased coin ten times, we might wish to find the 
probability of getting four heads! Here n = 10, p (head) = 0.5, q (tail) = 0.5 and 

q = 1 ~P 

The probability of obtaining r 'successes' in ' n ' trials (tests) is given by a 
formula which incorporates our work on combinations in Chapter 5, namely: 


P(r) = n C r p r q n ~ r 

where r = 0, 1, 2, 3 ... n 


In Chapter 5 we saw that: 
n! 

n ° r ~ (n - r)\ r\ 

The probability of getting exactly four heads out of ten tosses of an unbiased 
coin, can therefore be solved as: 

P(4) = )0 C 4 0.5 4 0.5 6 

10! 10 x 9 x 8 x 7 _ 21Q 

now ) 0 C 4 - 4^1 4 ! 4 x 3 x 2 x 1 

So P(4) = 210 x (0.5 ) 4 x (0.5)" 

p( 4 ) = 210 x 0.0625 x 0.015625 
P(4) = 0.2051 

In other words the probability of getting exactly four heads out of ten tosses of 
an unbiased coin is 0.2051 or 20.51 per cent. 
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It may be useful to state the formulae for finding all the possible pro 
ities of obtaining r successes in n trials. 

Where P(r) = „C r p r q"~ r 

and r = 0, 1 , 2 , 3, ... n 

then, from our knowledge of combinations (see p. 119): 


P(0) = q n 
P(l) = npQ ' 1 " 1 


P(2) = 


n(n ~ Vq - 2 

2 xr 


P(3) = 


n(n - 1 ){n - 2 ) 3 ^ 
3x2x1 


P(4) = 


njn - l)(n - 2)(n - 3) 4^,4 

4 x 3 x 2 x 1 


P(n - 2) = 


n(n -_ 1 ) „- V 
2x1 


P(n - 1) = nff^q 
P(n) = p" 


PAUSE FOR THOUGHT 6.2 


Can you think of three different types of problem which might be solved 
using the Binomial distribution? 


We can, of course, use this approach to answer questions which ask for the 
probability of an exact number of 'successes' occurring, such as P(4). We can also 
use this approach to answer broader questions such as the probability of ‘less 
than 4' successes occurring, which would be P(0) + P(l) + P(2) + P(3), and so on. 


Cumulative Binomial distribution 

An alternative to such calculations would be to use tables of the cumulative 
Binomial distribution presented in Appendix 2 (p. 315). These tables are 
cumulative in that they give the probabilities of r or more 'successes’. For example, 
if we require the probability of six or more items in a sample of ten being defect¬ 
ive, when /; = 0 . 20 , then from the table we find that P (6 or more) = 0.0064. It 
therefore follows that for the same sample, P(5 or less) = 1 - 0.0064 = 0.9936. 


A firm produces a product and finds that 10 per cent of its output is defective A 
small sample of five items is taken from the production line. Find the probability of 
getting each of the following numbers of defective items in the sample: 0 . 1 , o 3 
4 and 5 defective items respectively. 


WORKED 
EXAMPLE 6.4 








Solution 
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SELF-CHECK 

QUESTIONS 


Here n = 5, p = 0 .1, q = 
calculate the probabilities 


°' 9 ' Us 'ng the formulae listed 
as being: 


above (p. 132), we can 


P(0) = q 5 = (0.9) 5 = 0.5905 


P(l) - npq 4 = 5 x 0.1 x (0.9) 4 = 0.3281 

D/Q\ _ ~ 1) 2 3 5 x4 

2x1 PQ = 2~xl X (0 - 1)2 x (°- 9 ) 3 = 0-0729 


P(3) 

P(4) 

P(5) 


o(n-l)(n-2) „ 5x4x3 

3 x2xl PQ= J ^2xl X (0,1) X (0 ' 9)2 = °- 0081 

np A q = 5 x (0.1) 4 x 0.9 = 0.00045 

P 5 = (0.1) 5 = 0.00001 


So, for example, the probability of exactly two out of our five items being defective 
is 0.0729 or 7.29 per cent. 


6.4 Evaluate the following expression using your knowledge of combinations, 

(a) 3 C 2 (b) 10^3 

( C ) 12^5 ( d ) 14^8 

6.5 If 10 per cent of the items produced by a manufacturer are defective, find 
the probability that for a sample of six items: (a) two items will be 
defective; (b) three items will be defective. 

6.6 The probability that an invoice contains an error is 5 per cent (0.05). If an 
audit of ten invoices is taken, what is the probability of finding less than 
three incorrect invoices. Solve using the cumulative Binomial probabilities 
table (p. 315). 

Note: Answers can be found on pp. 344-345. 


Normal approximation to the Binomial 


Using the formula on p. 131 to calculate the probabilities for given numbers of 
successes to occur is obviously time consuming. It is therefore useful to note 
that if // is large and p > 0.1 then we can solve calculations such as those in 
Worked Fxample 6.4 using the Normal distribution considered at the start of this 
chapter. The Z statistic is calculated as before: 



a 


However when we are usin# 
tion, Binomial-type problems 


tin* Normal distribution to solve, as an approxinio- 
we must express ji anil rr in particular ways. 
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figure 6.6 

Normal 

approximation to the 
Binomial distribution 



30 

Number of ‘successes’ 


For a Binomial distribution 

H = np 



where p = probability of 'success' 
q = probability of 'failure' 
n = number of trials 

Generally speaking, as a 'rule of thumb' we can use the Normal approxima¬ 
tion to the Binomial distribution when np > 5, and p > 0.1 

Continuity correction 

Whereas the Binomial distribution represents discrete data, the Normal distribu¬ 
tion represents continuous data. Strictly speaking, whenever we are approxim¬ 
ating a discrete variable with a continuous one we should use a continuity 

correction. 

The idea behind the continuity correction is illustrated in Figure 6.6. The 
curve, representing the continuous distribution, cuts through the blocks of the 
histogram, representing the discrete distribution. 

Clearly some areas, such as A are included undei the continuous distribution, 
while other areas, such as B, are excluded under the continuous distribution. 
Overall these areas will tend to cancel each other out. However, since each 
block of the discrete (Binomial) distribution represents a whole number, such as 
the number of 'successes', it can be regarded as extending from 0.5 below and 
ahrwp that whole number. 
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In Figure 6.6, to find the probability of between, say, 26 and 30 successes , 
we must find the area under the normal curve from 25.5 to 30.5. 

P(26 to 30) = P(25.5 to 30.5) 

In Figure 6.6, to find the probability of, say, 30 or more 'successes , we must 
find the area under the normal curve to the right of 29.5. 

P(30 or more) = P(29.5 or more) 


Continuity correction 

To adjust discrete data to continuous data, find the probability from 0.5 & 
the lowest whole number (integer) to 0.5 above the highest woe 
(integer). ___ 


WORKED 

example 6.5 


>ast data suggest that for every 100 telephone enquiries at the feception 
lesk of a hotel, only some 20 bookings are made. Find the P roba ^ J bookings . 
mm the next 100 enquiries received, the hotel will receive 30 


t 


Solution 

This Is a Binomial situation as each enquiry leads to only one of two outcomes, 
booking (success) or non-booking (failure). There is an established or fixed 
probabflity (p = 0 . 2 ) of any enquiry leading to a booking, based on past 

experience. > al , = 0 . 2 ), then we 

can on * *. ** w„h 

continuity correction, X 1 will be 29.5 rather than 30. 


p = 0.2 
0 = 0.8 

H = np = 100(0.2) = 20 

c 7 = -sfnpQ = 00(0.2)(0.8) - 4 

X 1 - np _ 29.5 - 20 = +2 38 
A - , 4 

V npq 
Z, = +2.38 

Using the Z tables (p. 314) 


P 1 = 0.0087 

There is a 0.0087 (0.87 per 


cent) probability of the next 100 hotel enquiries 


yielding 30 or more bookings. 
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figure 6.7 

Normal approximation 
to the Binomial 
distribution 



Clearly this is a much quicker method than finding P(30), P(31) • • • ( ^ 

the conventional formula for the Binomial distribution presented P- • 


SELF-CHECK 

QUESTIONS 


6.7 From the previous hotel booking problem calculate the probability that the 
next 100 enquiries will yield: 

(a) 25 or more bookings (b) 16 or more bookings 
(c) less than 14 bookings 

6.8 Suppose that 10 per cent of all houses in a district have burglar alarms. A 
random sample of 121 houses is selected; use the Normal approximation to 
find the probability that: 

(a) Exactly 14 of the houses have alarms? 

(b) Fewer than ten houses have alarms? 

(c) From seven to 11 inclusive of the selected houses have alarms? 

(d) From nine to 13 inclusive of the selected houses have alarms? 

Note: Answers can be found on pp. 345-346. 


6.4 Poisson distribution 

The Poisson distribution may be regarded as a special case of the Binomia 
distribution. As with the Binomial distribution, the Poisson distribution can b 
used where there are only two possible outcomes, 'success' (p) or 'failure' (q) 
ami these events are Independent. The Poisson distribution is usually use. 
where n is very large butis very small, and where the mean np is constant an. 

,he chanc 

typically used for unlikely eyents such as accidents,Twkes.'el’c! 0 " theH ' 0 ' 
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ine Poisson distribution 
to occur at random, such as 
minal, etc. 


is also used to solve problems where events tend 
incoming phone calls, passenger arrivals at a ter- 


for both 'success' (n\ In f 01 sol y in 8 Binomial problems uses the probabilities, 
only uses the probabilities ^ $ ° Mng P ° iSS ° n pr ° blemS 

given by the formula-^ ^ P ° SSlble t0 show that the probability of r successes is 


P(r) = 


e~y 

r! 


where e - exponential constant = 2.7183 
M = mean number of successes = np 
n = number of trials 
p = probability of 'success' 
r = number of successes 

If we substitute r = 0, 1 , 2, 3, 4, 5 ... in this formula we obtain the following 
expressions: 


P(0) = e‘ 


P(l) = pe 


rju 


P(2) = 


2x1 


P(3) = 


/r 3 e p 
3x2x1 


P(4) = 


ire~ 


4 x 3 x 2 x 1 


P(5) = 


p 5 e u 


5x4x3x2xl 


In questions you are either given the mean p or you have to find p from the 
information given, which is usually data for n and p; p is then obtained from 

the relationship p = np. 

You have to be able to work out e raised to a negative power. Remember 
from your work on powers (p. 293) that <■-= is the same as 1 so you can simply 


work this out using — ' 

Alternatively many calculators have a key marked f. The easiest way to 

„ H KavTe 'on vour calculator is to enter 3, press +/- key, press « key, and you 
find (say) e o V calculator does not have an e key but has an 

**««* ■»»**-— - ^ 

should obtain 0.049786. 


worked 
Sample 6.6 


th many employees, accidents occur at random at the 
, a large company wii ^ the probabNit y that four accidents occur on 

lean rate of 3 per day. 

single day. g fau)ty ca \\ on a telephone line and 2,000 

here is a 0.001 probabi ry^ ^ ^ ^ probabj | jt y of there being less than 

alls are made each line in a day? 
uee faulty calls on that line 

hp answered using the Poisson distribution. 

(a) and (b) can be answer 
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s t random and relatively few accidents occur. Although 

(a) Here accidents occur at ran = ■ = 3 jg known . The occurrence of non- 

p (accidents) is unknown p ther efore use the Poisson distribution 

accidents ( q) is irrelevant here, w 
to solve the problem. 


P(r) = 


P(4) = 


r\ 


- 3 3 4 


4 x 3 x 2 x 1 


= 0.0498 x 81 = 0 1681 


P(4) = 0.1681 

|„ other words there is a 0.1681 ( 16.81 per cent) probability of four accidents 
occurring on a single day. 

lb) Here the probability p is very low (p = 0.001) and the number of trials, n is 
"” ve"l 2.000). The mean (p) - np - 2.000 (0.001) = 2 faulty calls 

pe7day. Since it is also true that p = np < 5 the Poisson distribution would 

again seem appropriate. 

To get less than three faulty calls means P(0) + P(l) + P( 2 ) 


6^ Ll r 

Using P(r) = —— 

P -2o° e - 2 i 

We get P(0) = = — = e = °- 1353 

e~ 2 2 1 

P(l) = = 0.2706 

e~ 2 2 2 

P(2) = = 0.2706 

2x1 

So the probability of getting less than three faulty calls is 0.1353 + 0.2706 + 
0.2706 = 0.6765 or 67.65 per cent. 


Cumulative Poisson distribution 

An alternative to such lengthy calculations would be to use tables of tin 
ulativc Poisson distribution presented in Appendix 3 (p. 316). lb eH 
are cumulative in that they give the probabilities of r or more 'successes • 

For example, if we require the probability of less than three taulh *• 
occurring in a single day when they typically occur at random at the nu’a 1 ' ^ 
of two per day on a telephone line (see Worked Example 6.6(b) above), the1 ^ , 
adapt r to be three or more for purposes of the cumulative tables. So for I 1 . 
r = 3 we read off the value 0.3233 from the cumulative tables. The P I ° b y ult y 
of less than three faulty calls is, of course, 1 - probability of three or o ,0,c ( | )C > 

calls, i.e. 1 - 0.3233 = 0.6767. This is close to the probability found by 
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Workpdp 1116 ^^ + P ^ + P ^ for the Poisson distribution in the answer to 
Worked Example 6.6(b) above. 

Normal approximation to the Poisson distribution 

If the Poisson distribution is appropriate except that p (= tip) > 30 (instead of 

p - < 5) then the Normal approximation to the Poisson distribution can 

be used. The Z statistic is calculated as before and the Z tables of Appendix 1 

(p. 314) can be used to find probabilities in the respective tails of this 
distribution. 

Z-Xj-t* 

<7 

where p = np 

(T= <y/«p(l - p) 


■Jnp( 1 - p) 

Note: the continuity correction when adjusting discrete data to continuous 
data (see p. 135) will also apply here. 

The following flow chart will give you some guidance on when to use the Binomial 
or Poisson distributions, or the Normal approximations to these distributions. 

Of course the flow chart only applies to variables which are discrete (take 
on fixed values), which can take one of two possible outcomes ( p,q ) and which 
occur independently. 


figure 6.8 

Selecting the 
distribution to use 
(discrete variable, 
two outcomes, each 
independent) 



Note: + Poisson is used here as an approximation to the Binomial 
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SELF-CHECK 

QUESTIONS 


REVIEW 

QUESTIONS 


figure 6.9 


6.9 Suppose n = 1,000 and p = 0.0002 use the Poisson distribution to find: 

(a) P(X = 0) (b) P(X = 4) 

(c) P(X > 1) (d) P(X 4) 

mean value ji = np = 1,000 x 0.0002 = 0.2 

6.10 The average number of trips per family to amusement parks in the UK is 
Poisson distributed with a mean of 0.7 trips per year. What is the 
probability of randomly selecting a family and finding that: 

(a) The family did not make a trip to an amusement park last year? 

(b) The family took two or more trips last year? 

(c) The family took three or fewer trips over a three year period? 

(d) Took exactly four trips over a six year period? 

Note: Answers can be found on p. 347. 


6.1 If the variable X, follows a Normal distribution with mean =100 and stand¬ 
ard deviation = 5, find the probability that: 

(a) P(X, > 107) (d) P(101 < X, < 109) 

(b) P(X, > 98) (e) P(91 < X ; < 94) 

(c) P(X, < 95) (f) P(93 < X ; < 106) 

6.2 Find, using the standard Normal tables (Z tables, p. 314), the value of Zj for 
each of the following: 

(a) P(Z > Zj) = 0.3300 

(b) P(0 < Z < Zj) = 0.4732 

(c) P(Z < Z,) = 0.8212 

(d) P(Z < Z,) = 0.5478 
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63 The mean height of 4 000 rhiu 

deviation of 7 cm Ass • ren m a town * s 130 cm w hh a standard 
how many children areTf 1 ^ ^ heightS t0 be norma Uy distributed, find 


(a) height over 144 cm 

(b) height less than 123 cm 

(c) between 126.5 cm and 137 cm 

(d) determine the upper and lower quartile heights. 


6 ' 4 for^n!T d fT7 ° f maU M \ large Ci ‘y firm follows a pattern con- 
. g e orma ' distribution, with a mean time of arrival of 9.40 a.m. 
and with a standard deviation of 20 minutes. 

Estimate the number of occasions during the 250 working days in the 
year when the mail arrives 


(a) before the main gates open at 9 a.m. 

(b) after the arrival of the office staff at 9.20 a.m. 

(c) between 10 a.m. and 10.20 a.m. 


6.5 The life of a certain type of machine component is known to be approx¬ 
imately normally distributed with mean 25 hours and standard deviation 
30 minutes. Calculate the probability that a randomly selected component 
lasts for: 

(a) between 24 and 24.5 hours 

(b) between 24.25 and 24.75 hours 

(c) more than 24.5 hours 

(d) less that 24 hours 15 minutes 

(e) If 28.1 per cent of the components last less than ‘K’ hours, find the 
value of K. 

6.6 An engineer designs a car seat to comfortably fit women above the height 
of 157.5 cm. The mean height of women in the UK is 160.9 cm with a 
standard deviation of 6.0 cm. What proportion of women will be excluded 
from buying this car? (Assume the height of women is normally distributed.) 

6.7 A tyre company has invented a revolutionary new product. In order to 
overcome consumer resistance a mileage guarantee is offered with the tyre. 
Road tests suggest that the mean life of the tyre is 42,000 miles, with a 
standard deviation of 4,000 miles. The tests suggest that tyre life is norm¬ 
ally distributed. 

(a) What percentage of tyres will last for more than 45,000 miles? 

(b) What should the guaranteed mileage be if the firm wishes to replace 
no more than 4 per cent of tyres? 

6.8 The amount of coffee that an automatic filling machine puts into nominal 
100 g jars, is normally distributed and varies from jar to jar. The machine 
fills 50,000 jars per week and is currently set to a mean fill of 100.50 g. All 
jars below the nominal weight are rejected. The current reject rate is 1,500 
jars per week and each reject costs the firm 80p. 
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(a) Find the standard deviation of the |ar weights 
,. f the reliability (standard deviation) of the machine cannot be 
chanced and the firm wishes to save £400 per week on reject costs, 
to W hat mean fill should the machine be set? 


6 9 Air Inexcel has a policy of booking as many as 16 persons for an aeroplane 
’ that seats only 14. Records show that only 85 per cent of booked passen¬ 
gers actually arrive for the flight. Find the probability that if Inexcel books 
16 people on a flight, enough seats will not be available. 


6.10 A firm has agreed to accept delivery of a component shipped in batches of 
1,000 if, in a random sample of ten, no more than one component is 
defective. What is the probability that this acceptance sampling rule would 
lead to a batch with 20 per cent defective items being accepted? 


6.11 Suppose that 40 per cent of all students have their own personal computer. 
If 24 students are selected at random, find: 

(a) the probability that eight will have computers; 

(b) the probability that eight will have computers (using the Normal 
approximation to the Binomial). Compare the results with part (a). 


6.12 An airline offers all 180 passengers on the London/Madrid flight a lunch 
choice of chicken kiev or beef in beer. As passengers choose once the flight 
is in progress, there is no way of knowing in advance how many will choose 
beef on any particular flight. The airline does not want to disappoint cust¬ 
omers but at the same time, wants to avoid the food waste and cost of 
carrying excessive meals. Historical evidence suggests that 65 per cent of 
passengers choose chicken when offered the choice. What is the probability 
that, on any particular flight, the number of passengers choosing chicken 
will be more than 125? (use the Normal approximation to the Binomial to 

evaluate). 


6.13 The reception desk in a hotel usually receives four guests per five minute 
period. Assuming that the conditions for a Poisson distribution hold, n 

~ . _•_1_l»r fniir miPStS 


periou. /\iouiiiiii 5 - 

(a) the probability that over a five minute period exactly four guests 
will arrive? 


will arrive/ 

(b) the probability that two or more people will arrive. 

(c) the probability that exactly six people will arrive over a ei 

period? 


6 14 A certain process produces 100 in long rolls of high quality silk, hi j' r * 
to assess quality a 10 m sample is taken from .m end oI cI^ 
inspected for blemishes. The number of blemishes In ,,r 

to follow a Poisson distribution with an avuagc 
10 m sample. Determine the probability that: 


III oi**i*|^*~’- 

«rss srrrssr rrr:». - 


fol/pn 
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6.15 The proportion of defective components produced by a firm is 0.6 per 
cent. The components are sold in boxes of 250 and the company guaran^ 
tees to replace any box containing more than three defectives. The cost o 
each replacement box is £4. The company is thinking about introducing 
a more comprehensive inspection scheme that would cost lOp per o 
but would eliminate all defectives. Use the Poisson approximation to 
Binomial to decide whether this inspection scheme is worthwhi e. 

Answers to Review questions can be found on pp. 419-424. 


Further study and data 


Texts , 

Bancroft, G. and O'Sullivan, G. (1993), Quantitative methods for accounting an 

uwl"’ "" f “ 

— 5th edn ’ “ 

(JTt 9 (1996), Essentia, Cements of business statistics, BPP, chapters 5 
Swift,! (2001), Quantitative methods for business, management and finance, 
ThomlfM! IwfQtiantitative methods for business studies, Prentice Hall, 
” n er (1997) Ouantttam methods for business, 2nd edn, Addison Wesley 

Waters, D. (199/j, ^ 

Longman, chapter 14. Founda tion quantitative methods for 

Wisniewski, M. with tea , • 

business, Pitman, chapter . 

Sources of information and data 

see the list a. the end of Chapter 1 (P- 25). 
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CHAPTER 


Sam 


pling and tests of hypothec 


Objectives 

When you have read this chapter you 
should be able to: 

► appreciate the key features of the 
different types of sample; 

► be aware of the properties of the 
distribution of sample means, e.g. the 
standard error, 

► apply the ‘Z’ score and tables to the 
distribution of sample means drawn 
from a Normal population; 

► understand the Central Limit Theorem; 

► apply the ‘Z’ score and tables to the 
distribution of sample means drawn 
from a skewed population; 

► understand and calculate confidence 
intervals for either the population mean 
or the sample mean; 

► know when and how to use the Student 
t distribution; 

► know when and how to use the 
Chi-squared distribution. 


Introduction 


Chapter 6 introduced the Normal distribution and the 
Z statistic. In this chapter we continue to apply these 
ideas, but in the context of sampling. Most firms would 
find it either impractical or too expensive to survey all 
their customers or to carefully examine every item that 
flows from their production line. Instead they usually 
resort to selecting a sample from the whole group or, as 
it is often called, the population. Of course here the terra 
'population' can refer to all the items under considera¬ 
tion as much as to all the people. 

We briefly consider the types of sample that might 
be selected, and when each type might be most appro¬ 
priate. We then go on to consider the distribution of 
sample means which, as we shall see, can under certain 
conditions be regarded as being normally distributed, 
thereby allowing us to use the Z table probabilities o 
Chapter 6. The Central Limit Theorem is suggesting 
that even when the 'population' is skewed, a randon’ 
sampling procedure using large samples can still res 
in a Normal distribution for the sample means. 

We move on to explore confidence limits, i- e - 

nf ™ ^ rangeS ° f ValUeS Within Which we can ' With Va tvU 

for 8 testing varin an observation to lie. This is used as 

for such tests the ° r hypotheses. When we use large sample ' ^ 

small samples („ < " 2 ) the taWeS a ' e StUI a PP r °Phate. However when « 

' number of „,he r H,K We 'he student t distribution'' > 

ie Chi-squared dlstributlon USeCl ^ t<?St hy P otheses are lnt roducei sl 
rhe term statistical h,fcrcr' n ': .- .ncedn lV 

In rU:.. . ' 


a 
the 


discussed in this ch u‘J" lir< ! UV ls often usocl to refer to the many P llK ‘ tio ii 
based on the results ^ZZn hereby tributes are ascribed to the ‘poP^ 1 
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7.1 Types of sample 
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tics^or^ttrU^T ° f sa ” lp ! e ma y be used > depending in part on the characteris- 
those undp t ^ ° u ^ P°P u ^ ati ons' to be sampled and on the objectives of 

of thP v, 8 SamplC - Heie We briefl y review some of the ke Y Matures 

of the various types of sample. 


Random sampling 

A random sample is used when the intention is to give each item in the 
population as much chance of being selected in the sample as every other 
item. A common way to conduct random sampling is to list all the 'members' 
or items in the population and then to use random number tables (merely 
a collection of random digits) or computer generated random numbers to help 
select the sample. Such procedures avoid various types of 'bias' which might 
creep into the selection of items to be included in the sample when undertaken 
by humans. Even the selection of numbers by individuals might be influenced 
by unconscious preferences for odd, even numbers, etc. 

The key objective of random sampling is to obtain a sample that fairly 
reflects the population as a whole. Random sampling is more likely to be used 
when the population itself is relatively homogeneous, i.e. is composed of items 
of broadly the same type. Of course there may be considerable practical difficul¬ 
ties in actually locating and interviewing each 'member' of the population that 
has been randomly selected. For example, if the chosen adult is not at home 
or otherwise available a researcher may have to return or seek to contact that 
individual on another occasion - another member of the household, etc. will 
not be acceptable. 

Suppose the random number table (see Appendix 4, p. 317) is used in 
seeking a random sample of, say, 40 items. This might involve linking together 
two columns of the five digits displayed in the table. These might begin 83635 
suggesting we choose the first listed item in the first column, then 66791, 

and so on. 


Random sampling 


► 

► 

► 

► 

► 


, h item selected has an equal chance of being drawn; 

'ally adopted when the population is largely homogeneous, i.e. when it 
difficult to distinguish between items; 

Cementation often involves the use of computer-generated random 
nnbers; 

lection is unbiased; 

“ '. dra wbach is that a population listing is required and the chosen 
„, c ■„ be located, then questioned or measured. 


THOUGHT 7.1 


_ , _ wpH-known products for which a random sampling 

«*** approprie,e ,or ,esme cus,omer ,esponse? 
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The random sampling procedure will also be important should we wish to 
apply the Central Limit Theorem (see pp. 150-152). 


Stratified sampling 

Of course there will be times when we might prefer a non-random sampling 
procedure. Such a non-random procedure might, for example, be preferred when 
the population is segmented or heterogeneous, i.e. when it contains very different 
sub-sets of items or subjects. If, say, 40 per cent of all adults have incomes over 
£x and 15 per cent have incomes below £y, then we might wish our sample of 
adults to exactly mirror such proportions, rather than give every adult an equal 
chance of selection. This is the basis of stratified sampling. Note that the 
intention here is still to obtain a representative sample, but one which 'fairly' 
reflects a population which itself is heterogeneous. We might say that such 
a sample is free from 'selective bias', since the proportion of any identified 
attribute in the sample is merely a reflection of its contribution to the popu¬ 
lation as a whole. 

Stratified sampling 

► used when the population has a number of identifiable attributes; 

► populations stratified in this way are known as heterogeneous; 

► the composition of the sample must reflect the attributes present in the 
population, e.g. the proportion of low-, middle- and high-income earners; 

► individuals or items within each stratum may still be selected randomly; 

► a stratified sample is free from selective bias, since it reflects the 
proportions of any given attribute present in the population as a whole. 


DID YOU KNOW? 

It costs five times as much to recruit a new customer 
via marketing strategies (e.g. sample surveys of new 
products/designs, etc.) as it does to service and retain 
an existing customer. 


DID YOU KNOW? 

By the year 2011 it is estimated that 17 per cent of the 
UK population will be aged over 65 years. This ‘ageing’ 
population will mean an increased demand for those 
products targeted toward the elderly. 


Quota sampling 

The use of quota sampling is widespread 
in market research. Here the intention is 
often to deliberately introduce selective 
bias into the samples, in the sense that 
attributes of the members or items selected 
will represent the choice of the samples 
rather than the attributes of the popula¬ 
tion as a whole. In this sense there is no 
attempt to seek a representative or unbi¬ 
ased sample from the population. For ex¬ 
ample, if a firm sells most of its product 


to those with incomes over £x, then the sample may contain HO per cent of 
such adults even it they only comprise, say, 40 per cent of the population. Nor 
is there usually any attempt to use random sampling within the quotas selected, 
as often happens within the different strata in stratified sampling. It is often left 
to the discretion ot interviewers, etc. to include specified numbers (quotas) of 
subjects possessing given attributes within the sample. To do so accurately 
may be costly, requiring highly trained interviewers. 


SAMPLING AND TESTS OF HYPOTHESES 147 

Quota sampling 

^ widely used in market research; 

sample includes a specified number or quota of subjects with given 
attributes; 

► interviewers must be highly trained as they are often responsible for 
identification and selection of respondents; 

► a biased sample therefore results, but one which may be useful in 
representing the customers seen as most likely to purchase the firms 
products. 


C an you identify three well-known products for which a quota sampling 
procedure might be considered appropriate for testing consumer response? 


Multi-stage and cluster sampling 

Other types of non-random sampling include multi-stage and cluster 
sampling. 

Multi-stage sampling 

► usually involves sampling of subjects with a given attribute; 

► often occurs where there is a wide geographical spread of such subjects 
which makes sampling expensive; 

► whole geographical area is therefore divided into regions; 

► a small number of such regions is then selected randomly; 

► regions selected are further broken down into sub-regions (e.g. North West 
England into parliamentary constituencies), from which a random sample 

is selected; 

► sub-regions selected are further broken down into units (e.g. towns or 
streets), from which a random sample is again selected; 

► eventually individual households or persons with a given attribute are 
identified in specific towns or streets; 

► costs of interviewing is then much reduced, though some risks of bias in 
procedure especially if only small numbers of regions, sub-regions, etc. 

are selected. 

Cluster sampling 

► items chosen in clusters rather than individually; 

► example, cluster might be all residents in a particular road or group of 

roads; . , , 

^ useful method where the population is widely spread geographically but 

where the various clusters are broadly representative of that population; 

► similar to above sampling method, except that the cluster itself is the 
single-stage involved. 
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DID YOU KNOW? 

If your target market were, say, boys between 11 and 14 
years, a recent government report into young people’s 
lives would provide useful marketing information. For 
example, boys between 11 and 14 years: 

have on average £2.30 pocket money per week; 
have as their favourite comics, magazines, games, 
and newspapers: 

► Viz (26 per cent of all boys) 

► Gamesmaster (24 per cent of all boys) 

► Beano (24 per cent of all boys) 

► Sega Power (23 per cent of all boys) 

► The Sun (22 per cent of all boys) 

have as their favourite TV programmes - Neighbours, 
Coronation Street, EastEnders, Home and Away 
(respectively). 

► have football as their main activity in free time 
(50 per cent of all boys) etc. 


7.2 Distribution of sample means 


If the distribution of the population (X,) 
is Normal then the distribution of the 
arithmetic means (X,) of samples of size n 
taken from that population will be Normal 
(see Figure 7.1). The new distribution will 
have the same arithmetic mean p but a 
different standard deviation. We call this 
the standard error (SE) and it can be found 
by dividing the standard deviation of the 
population (cr) by the square root of the 
sample size (i.e. 4n). 

Standard error (SE) = standard 
deviation of distribution of sample 
means 

SE = -y= or SE = -^= 

Vn yn 

Note: where the population standard 
deviation (o) is not available, the sample 
standard deviation (s) is often used as an 
approximation: 


figure 7.1 

Distribution of 
population for 
variable X, and for 
the means X, of 
samples of size n 
drawn from that 
population 
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Of course if the sample were, say, of size 1 then we would be back to individual 
members of the population X, and there would be no difference between SD 
and SE (SD = SE = <r/Vl = cr). However if the samples were of a size greater than 
1, then the samples themselves would tend to average out any extreme values of 
X;, so that the distribution of sample means (X,) could be expected to have a 
smaller dispersion around the population mean (/./) than would the distribution 
of each item of the population. The larger the sample size n, the smaller this 
dispersion of X, around fi is likely to be. For example, samples of size 36 taken 
from the population (X ; ) give an SE for the distribution of sample means (X,) of 
ct/V 36 which is smaller than the SE had the samples taken from the population 
only been of size 4, giving SE = o/a/4 . 


Using the Z statistic 

Because the distribution of sample means is Normal , we can still use the Z 
statistic. However note that we now calculate the Z statistic as. 


X rz£ 

o 

-Jn 


WORKED 
EXAMPLE 7.1 


A production line makes units which are normally distributed with a 
of 200 grams and a standard deviation (a) of 9 grams. What is the 
a sample of 36 units having a (sample) mean weight of 203 grams 


mean weight 
probability of 
or more? 


figure 7.2 


Solution 



200 203 

i _ i- 

o z, 
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H = 200 grains 
<r = 9 grams 


n = sample size = 36 
cr 9 


SE = 


Vn V36 


= 1.5 


z* = UL 

7fn 


203 - 2 00 
1.5 


= +2 


Using the Z tables (Appendix 1 , p. 314 ) 
Pi = 0.0228 


7 *3 Central Limit Theorem 

Even if the population is not normally distributed, the Central Limit Thee 

disnibuhon of a tb if SamP ! ing " rand ° m and ' he sam ? te W * * 32, the” h 
“ of ' he sam P‘ e means can be regarded as approximately normal!, 
distributed, with mean „ and standard error a/* as shown in Figure 7 3 

We can therefore still use our Z statistic for calculating probabilities for thi 
istribution of sample means even when the population is not itseif normallv 
distributed. All we need do is use a random sampling procedure and make sure 
that our sample size is large enough (at least 32). 


figure 7.3 

Central Limit 
Theorem: population 
skewed, but 
distribution of 
sample means ( X ,) 
Normal, with same 
mean (g) and 





means 
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WORKED 
EXAMPLE 7.2 


figure 7.4 


of 1 rrrf if ^ pane,s has a mean thickness of 4 cm and a standard deviation 

of thp g a , ran ° m sample of 100 glass panels is taken, what is the probability 
the sample mean having a thickness of between 3.9 cm and 4.2 cm? 


Solution 

Th n ^ a * theorem applies as random sampling has been adopted and n > 
e istribution of sample means (X,) is therefore Normal with mean (u) = 4 
cm and standard error c/Vn = 1 /a/ioo = 0.1 cm. 



Zi = 



3.9 - 4.0 
0.1 


-1 


P 1 = 0.1587 

A 1 = 0.5000 - 0.1587 



4.2 - 4.0 

Oil 


= +2 


P 7 = 0.0228 
A 7 = 0.5000 - 0.0228 
A 7 = 0.4 772 
A, + A 2 _= 0.8185 

i.e. there is an 81.85 per cent chance of the sample mean of 100 glass panels 
having a thickness of between 3.9 cm and 4.2 cm. 
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SELF-CHECK 

QUESTIONS 


7 1 It is thought that the annual cost of transport to and from work for 
employees of a company is normally distributed with mean £100 a year 
and standard deviation £10. What is the probability that a sample of 
100 employees will give a sample mean of. 

(a) less than £98 a year 

(b) more than £103 a year? 

7.2 A departmental manager believes that the average value of a large 

population of invoices is £10 with a standard deviation of £2. What is the 
probability that a sample of 36 invoices selected at random will give a 

sample mean: 

(a) below £9.50 

(b) above £10.75 

(c) above £11? 

7.3 An automatic machine fills jars of jam with a mean net weight of 340 grams 
and a standard deviation of 8 grams. What is the probability that a sample 
of 64 jars selected at random will have a sample mean: 

(a) above 340.75 grams 

(b) below 338.5 grams 

(c) between 339 grams and 342 grams? 

Note: Answers can be found on pp. 347-348. 


7.4 Confidence intervals 


A confidence interval is a range of values within which we can have a 
certain level of confidence (usually 95 per cent or 99 per cent) that a particular 
value of a variable will lie. In the context of sampling, we are usually interested 
in one of two possible types of confidence interval. 

Confidence interval for the sample mean 

This is the range of values around the population mean (p) within which vve 
can be 95 per cent or 99 per cent confident that a particular sample mean (V 
will lie. 

95 per cent confidence interval 

In terms of Figure 7.5, we wish to find the values for X { and X>, which are 47.a 
per cent (0.475) either side of the population mean //. 

Of course this will imply 2.5 per cent (0.025) in each tail of the distribution 
for X,. 

We have already seen (p. 130) how we calculate the Z, and Z, scores (+1-99 
and -1.96 respectively) which will give us 0.025 in each tail. We can then use 
our formula tor the Z statistic (as applied to the distribution of sample means) 
to calculate X, and X 2 : 





+ 1.96 




(j / 


V/ 


= x 1 -n 


Id + 1.96 



= *i 


We can use the same procedure for finding Z 2 , except that we insert -1.96 for 


the Z 2 score. You should then be able to show that: 


H - 1.96 


\ 

cr/_ 
V/ xn) 


= X 7 


The end values of the confidence interval are called the confidence limits. Here 
X, and X 2 are the confidence limits for the 95 per cent confidence interval. 


In general, the 95 per cent confidence interval for the sample mean can be 

written as follows: 

% = /d ± 1.96 (ct/a fn) 

i.e. X, = H ± 1-96 SE _ 

In other words we can be 95 per cent confident that a single sample mean (X,) 
will lie within 1.96 standard errors (SU) of the population mean //. 


99 oer cent confidence interval 

The approach here is exactly the same as above except that we wish to find the 
values for the distribution of sample means, namely A, and .V„ winch are 49.5 
nor rent (0 495) either side of the population mean p. Of course this will impl» 
, per cent (0.005) in each tail of the distribution for 


.V 
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As we shall see in Figure 7.6(b) below, the only difference will be that 
Z, and Z 2 scores respectively will be +2.58 and -2.58 in order to give 0 0 (K° Ur 
each tail. ' m 


WORKED 
EXAMPLE 7.3 


figure 7.6 


In general the 99 per cent confidence interval for the sample mean can be 

written as follows: 

X = p ± 2.58(cr/-\/n) 
i-e. X, = p± 2.58 SE 


In other words we can be 99 per cent confident that a single sample mean (X) 
will he within 2.58 standard errors (SE) of the population mean (/i). 

The end values of this 99 per cent confidence interval, X x and X 2 will be 
the 99 per cent confidence limits. 


A large number of random samples of size 100 are taken from the production line 

for glass panels which is thought to produce panels with mean thickness 4 cm 
and standard deviation 1 cm. 

Find (a) the 95 per cent and (b) the 99 per cent confidence intervals and 
associated confidence limits for the sample mean. 


Solution 



(a) X, =/i ± 1.96(rWn) 

X = 4 ± 1.96(1/7100) 

X = 4 ± 1.96(0.1) 

X - 4 ± 0.196 
X = 4.196 and 3.804 
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The 95 per cent confidence interval is 3.804 cm to 4.196 cm 
In terms of Figure 7.6(a) the confidence limits are: 

X r = 4.196 cm 
X 2 = 3.804 cm 
(b) Xj — n ± 2.58 (a/Vn) 

Note: if you check the Z tables in Appendix 1 (p. 314) you will see that 2.575 
is strictly the Z score giving 0.005 in each tail. Rounding up the 5 in the third 
decimal place gives 2.58. 

X,, = 4 ± 2.58 (l/s/lOO) 

x; = 4 ± 2.58 (0.1) 

X, = 4 ± 0.258 

The 99 per cent confidence interval is 3.742 cm to 4.258 cm. 

In terms of Figure 7.6(b) the confidence limits are: 

Xl = 4.258 cm 
X 2 - 3.742 cm 

Confidence interval for the population mean 

This is the range of values around the sample mean (X,) within which we can be 

nr npr rent or 99 per cent certain that the population mean (p) will lie. 

^ P , __in thiQ le wp are likelv to 


Th. nrpvious analysis applies here except that in this case we are likely to 



sample means: 


a s 


a 


i.e. SE = —r= - t- 

In Vn 

xu ^ the 7 etatistic for the distribution of sample means is 
We have seen that the Z statistic un 


We have seen 


i.e. SE = —r= 
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All we have done is re-arrange the terms to have p on the left-hand side this 
time instead of X,. As before the confidence intervals for population 
mean can be written as follows: 


fi = X,± 1.96 



for a 95% confidence interval 


i u = X,± 2.58 



for a 99% confidence interval 


WORKED 
EXAMPLE 7.4 


Estimate (a) the 95 per cent and (b) the 99 per cent confidence intervals and 
associated confidence limits for the population mean when we are given the 
following sample data: 

sample mean (X,) = 950 kg 
sample standard deviation (s) = 15 kg 
sample size (n) = 36 


Solution 


(a) 95 per cent confidence interval and confidence limits for p 
p = Xj ± 1.96 (s/Vn) 
p = 950 ± 1.96(15/^36) 
p = 950 ± 1.96(2.5) 
p = 950 ± 4.9 
p = 945.1 kg to 954.9 ke 

In other words we can be 95 per cent confident that the population mean p 

W ' -h 1? Lf in1:erval l3etween 945.1 kg and 954.9 kg, given the sample data 

nn° 1 1 6 f 6Se va ' ues are P er cent confidence limits for the 

population mean. 


(b) 99 per cent confidence interval and confidence limits for p 
» = X,± 2.58(s/Vn) 
p = 950 ± 2.58(15/V36) 
p = 950 ± 2.58(2.5) 

P = 950 ± 6.45 


tL-': vm.Vo kg to 956.45 kp, 

... ■" - 

values are th<> qq kt> d 956-45 kg> S |Ven the sam P |e data. These 
e ,he " p<!, cenl confidence limits to, the population mean. 
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SELF-CHECK 

QUESTIONS 


ACTIVITY 5 


™ nagement °| a company claim that the average weekly earnings 
eir employees is £550 with a standard deviation of £120. A random 
sample of 144 employees is taken. Calculate (a) the 95 per cent and 
(b) the 99 per cent confidence intervals for the sample mean. 

7.5 A random sample of 64 households was selected in a town and the average 
expenditure per week on a certain product was found to be £6.15 with a 
(sample) standard deviation of £1.30. Calculate (a) the 95 per cent and 
(b) the 99 per cent confidence intervals for the population mean. 

Note: Answers can be found on p. 349. 


Confidence interval for p when s is unknown 

Two random samples of components were selected from a production process 
with the intention of estimating their life expectancy in hours. The first random 
sample contained 60 components, the second random sample contained 120 
components. Estimate the 90 per cent, 95 per cent and 99 per cent confidence 
intervals for the population mean from each sample. 

The data from the samples were as follows: 


Sample 1 


286.5 

304.5 

310 

310 

300.6 

296.5 

314.4 

307.2 

293.7 

302.4 

287.5 

310.1 

298.5 

299.7 

304.6 

302 

301.9 

297.2 

292.9 

294.6 

Sample 2 

297 

311.9 

310.5 

301.2 

309.8 

295 

298.6 

294.9 

291.9 

305.8 

295.2 

321.6 

291.5 

301.2 

286.6 

290.1 

297.4 

297.4 

309.6 

295.4 

291.4 

298.1 

297.7 

302.3 

304.8 

296.7 

304.4 

304.8 

306.2 

304.4 


301.3 313.9 

306.4 288.4 

304.6 291.5 

299.8 316.1 

312.4 302.1 

305.5 295.6 

307.7 304.3 

290.8 314 

298.2 302.1 

296.2 306.5 


301.4 305 

313.4 288.4 

301.3 303.7 

292.2 301.6 

290 300 

297.9 303.9 

310.4 299.4 

305.7 306.2 

317.2 302.4 

309.8 299.4 

292.7 292.6 

293.2 289 

297.4 290.2 

303.2 299.1 

297.2 296.6 


309.7 

287.4 

310.4 

285.7 

299.2 

296.7 

287 

307.9 

296.4 

292.7 

291 

301.2 

292.6 

296.2 

298.5 

291.6 

295.2 

299.7 

284.2 

304.6 


305.1 

309.7 

301.8 

295.2 

304.3 

290.2 

309.2 

305.7 

298.5 

302.1 

293.9 

300.6 

308.9 

299.6 

304.8 

301.6 

305.5 

290 

289.5 

307.8 

277 

301.8 

300.1 

303.8 

299.7 

303.1 

290.8 

304 

315.7 

302.6 


295.1 319.4 

305.8 289.2 

298.3 304.7 

312.8 301.2 

301.8 299.1 

301.5 311.3 

291.1 287.7 

299.3 307.9 

313.6 292.1 

299.1 296.1 

291.5 308.6 

297.2 276.4 

288.6 293.9 

286 308.2 

297.9 288.3 


1 


Enter all the data from 
into column B. 


Sample 1 into column A and the data from Sample 2 
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Analyse the two samples by setting out a table of results as follows: 



D 

£ 

F 

2 

SAMPLE 

1 

2 

3 

Size 



4 

Median 



5 

Mean 



6 

Std Dev 



7 

Std Error 



8 

Variance 



9 

Min 



10 

Max 



11 

Range 




3 Comment on your table of results. 

s 

4 Calculate the standard error by using: SE - —= 

Vr? 

5 Applying the Central Limit Theorem we can say that for 95 per cent: 
s 


H = X± 1.96 


rn 


For 90 per cent and 99 per cent: 

fi = X ± 1.645 ~ and p = X ± 2.575 respectively. 

Vn Vn 

6 Use the above formulae to find the respective confidence intervals. It is 
suggested that another table, such as the one below, will produce a clea, 
layout for your solutions. 


Conf level % 

Z Z*E Lower Upper Interval 

Sample 1 

90 

1.645 

95 

1.96 

99 

2.575 

Sample 2 

90 

1.645 

95 

1.96 

99 

2.575 

Comment on your results. Check with the responses on p. 468. 

7.5 Tests of hypotheses: principles 

We can use our 

work on the distribution of sample means 


Limit Theorem for testing hypotheses (theories). A hypothesis is an assump¬ 
tion about a situation. We usually want to test this assumption against one or 
more alternative assumptions. 
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It will be useful at this staap u 

widely used in this tonic ^ . ^ ecome familiar with the terminology 

main hypothesis (Null h™Bhesi? t H 1 r Pli ? ty We aSSUme that We are testing 3 
hypothesis, H,) a § ainst °ne other hypothesis (Alternative 


Null hypothesis <H„): the hypothesis to be tested. 

For example that the mean thickness of glass panels is 4 cm 
H 0 : p = 4 cm 


Alternative hypothesis 

the null hypothesis. 


(Hi), the hypothesis we wish to accept if we reject 


xample that the mean thickness of glass panels is not equal to 4 cm 
Hj : p * 4 cm 


In testing our null hypothesis, there are essentially two types of error we might 
wish to avoid. 


Type 1 error: rejecting the null hypothesis when it is, in fact, true. 

^ T yP e ^ error: accepting the null hypothesis when it is, in fact, false. 

Avoiding a Type 1 error is usually the main concern of problems in this topic 
area. When testing the null hypothesis we usually state the maximum risk we 
are willing to take of committing a Type 1 error, i.e. of rejecting the null 
hypothesis when it is, in fact, true. This is called the level of significance and is 
typically either 5 per cent (0.05) or 1 per cent (0.01). 


Levels of significance 

The maximum risk we are willing to take of making a Type 1 error, i.e. of 
rejecting the null hypothesis when it is, in fact, true. 


Confidence limits and critical values 

If we look back to Figure 7.6 (p. 154) we can usefully illustrate the idea of the 
level of significance. It is clear that there is a 95 per cent (0.95) chance of any 
sample mean ( X ,) lying within 1.96 standard errors (SE) of the true population 
mean p. However there is still a 5 per cent (0.05) chance of a single sample 
mean lying outside this 95 per cent confidence interval even though the popula¬ 
tion mean really is p. 

Here this 5 per cent (0.05) chance is split evenly to be 2.5 per cent (0.025) 
in each tail. If the sample mean lies outside the confidence limits X, or X, then 
our decision will be to reject H 0 even though we might be wrong in rejecting 
H„. However the chance of our being wrong is less than 5 per cent, since there 
was a less than 5 per cent chance of actually getting a sample mean X, outside 
X, and X 2 when p really is the population mean. 

These confidence limits for the population mean, p, can therefore be re¬ 
garded as the critical values for tests of hypotheses. In other words they are the 
values of X, (or their Z t equivalents) outside of which we are willing to reject the 
null hypothesis (H„). In that sense they are the values 'critical' to the decision 
we are about to take in our test of the null hypothesis (HJ. 
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figure 7.7 

Acceptance and 
rejection regions for 
the null hypothesis 
(H 0 ): two tailed tests 
at (a) 5% (0.05) 

(b) 1% (0.01) levels 
of significance 


Critical values 

These are the values of X, or Z, for the sample outside of which we are willing to 
reject the null hypothesis (H 0 ) at a particular level of significance. 

Two- and one-tailed tests 

Two-tailed tests 

The approach so far has involved a two-tailed test, i.e. where we split the risk 
of being wrong in rejecting the null hypothesis equally between each tail of % 
distribution. 

Let us formally set out the null (H 0 ) and alternative (H,) hypotheses f 0r 
two-tailed test. We use our earlier example of thickness of glass panels by way 0 f 
illustration. 

H 0 : p. = 4 cm 
Hi : p * 4 cm 

Since the alternative hypothesis (Hi) says p * 4 cm it is clear that we are just as 
interested in situations where p > 4 cm as we are in situations where p < 4 cm. 

Two-tailed test 

We divide the risk we are willing to take in making a Type 1 error equally 
between each tail of the distribution. 


If we test the null hypothesis at a 5 per cent (0.05) level of significance, we 
place 2.5 per cent (0.025) in each tail. Here we will reject H 0 if we think that 
p > 4 cm or if we think that p < 4 cm. 

Acceptance and rejection regions 

► The region within which we are willing to accept the null hypothesis (H 0 ) 
we call the acceptance region. 

► The region outside which we are willing to reject the null hypothesis (H^ 
we call the rejection region. 


F' 



0.05 level of 
significance 
(2-tail test) 

F' 

i 


0.01 level of 

significance 

j2iai[testL 


0.025 / 


0.025 


0.005 / 


0.005 

\ 

X 


-- 

J 

*** 


X 2 h 

X, 

— ► 


1.96 SE i.96 SE 

- _._ ^ ^ 


2.58 SE 2.58 SE 

Rejection Acceptance 
region region 

(a) 

—--- 

Rejection 

region 

---- 

Rejection Acceptance 

region region 

(b) 

Reject' 1 ’" 

regie ' 1 
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boundarvhT 116 ^ 0 ^ t0 ^ cr ‘ t * ca ^ values, i.e. the sample means at the 
/it \ 6 ween * e acceptance and rejection regions for the null hypothesis 

V n o;- 


WORKED Find the critical values for a two-tailed test at the 5 per cent (0.05) level of 

EXAMPLE 7. significance for the earlier example of glass panels, mean (p) = 4 cm, standard 

. error (cr/Vn) = 0.1 cm. 

Solution 

A 1 ± 1-96 SE = critical values 

4 ± 1.96(0.1) = critical values 
3.804 cm = X 2 ; 4.196 cm = 

We can be 95 per cent certain of getting a sample mean in the acceptance region 
3.804 cm to 4.196 cm if the true (population) mean is 4 cm. 

If, however, we get a sample measurement outside these critical values 
(i.e. in the rejection region), then we can reject H 0 : p = 4 cm with less than a 

5 per cent (0.05) chance of being wrong. This is because there is a less than 

5 per cent (0.05) chance of X • being outside these critical values if H 0 : p - 4 cm 
is true. 


WORKED Find the critical values for a two-tailed test at the 1 per cent (0.01) level of 

EXAMPLE 7.6 significance for the earlier example of glass panels, mean (p) = 4 cm, standard 

. error (cr/Vn) = 0.1 cm. 

Solution 

This time we place half of 1 per cent (0.005) in each tail, giving a Z score of 
±2.58. 

p ± 2.58 SE = critical values 
4 ± 2.58(0.1) = critical values 
3.742 cm = X 2 ; 4.258 cm = 

We can be 99 per cent certain of getting a sample mean in the acceptance region 
3.742 cm to 4.258 cm if the true (population) mean is 4 cm. 

If however, we get a sample measurement outside these critical values (i.e. in 
the rejection region), then we can reject H 0 : p = 4 cm with less than a 1 per cent 
(0.01) chance of being wrong. This is because there is a less than 1 per cent 
chance of X, being outside these critical values if H 0 : p = 4 cm is true. 


One-tailed test ...... ... 

Here we do not split the risk of being wrong in rejecting the null hypo¬ 
thesis equally between each tail of the distribution. Here we are only concerned 
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with one side (and therefore one tail ) of the distribution when testing our null 
hypothesis (H„) against an alternative hypothesis (H,). We therefore place all 
the risk of making a Type 1 error in one tail. 


One-tailed test 

We place the risk we are willing to take in making a Type 1 error in one tail of 
the distribution. 


Let us formally set out the null (H 0 ) and alternative (H,) hypotheses for a one- 
tailed test. We again use our earlier example of thickness of glass panels by 
way of illustration. 

EITHER 

H 0 : n - 4 cm 
Hi : /I > 4 cm 

In testing H 0 against Hi we are only interested in rejecting H 0 (i.e. accepting H,) 
if the sample means suggest a true (population) mean greater than 4 cm. 

OR 

H 0 : ju = 4 cm 
Hi : n < 4 cm 

In testing H 0 against Hi we are only interested in rejecting H 0 (i.e. accepting H,) 
if the sample mean suggests a true (population) mean less than 4 cm. 

Acceptance and rejection regions 

Figure 7.8 indicates the relevant acceptance and rejection regions for a one-tail 
test at 5 per cent (0.05) and 1 per cent (0.01) levels of significance. Remember 
the Z scores are calculated using the Z tables (Appendix 1, p. 314) and are used 
to give the area in the respective tails. 


WORKED 
EXAMPLE 7.7 


Look again at Figure 7.8(a), p. 163. Find the critical values for a one-tailed test at 
the 5 per cent (0.05) level of significance for the earlier example of glass panels, 
mean (/j) = 4 cm, standard error (cr/Vn) = 0.1 cm. 


Solution 

Xj= n + 1-65 SE 
X, = 4 + 1.65(0.1) 
X, - 4.165 cm 

X 2 = \i - 1.65 SE 
X 2 = 4 - 1.65(0.1) 
R 3.835 cm 
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figure 7.8 

Acceptance and 
rejection regions for 
the null hypotheses 
(Ho)' one-tailed tests 
a t°(a) 5% (0.05) 

(b) 1% (0.01) levels 

of significance 




Acceptance 

region 


^ ;-;— -— - 

Rejection Rejection Acceptance 

region region region 

(a) 



Acceptance Rejection 

region region 

(b) 



2.33 SE 


- 

Rejection Acceptance 

region region 


If the sample mean X, lies outside these critical values then we would be outside the 
acceptance region and in the rejection region. 

► For H 0 : p = 4 cm 

Hi : p > 4 cm 

Reject H 0 if X > 4.165 cm 

► For H 0 : g = 4 cm 

Hi : g < 4 cm 

Reject H 0 if X < 3.835 cm 


7.6 Tests of hypotheses: practice 

In this section we develop a simple seven-step plan for tackling all problems 
involving tests of hypotheses. This seven-step plan draws on the ideas presented 
in the previous section. 

However before we outline these steps let us select one of two alternative 
approaches to finding critical values. 
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figure 7.9 

Using the Z scores 
for the critical 
values 

(a) Two-tailed test 

(b) One-tailed test 

(c) One-tailed test 


Critical values: Z scores or sample mean values 


As we saw in Figure 7.8 (p. 163), the critical values can be expressed in terms 
of the sample means , X x and X 2 respectively. However they can just as easily be 
expressed in terms of the Z scores, i.e. the number of standard errors from p, the 
population mean. 

In Figure 7.9, a (alpha) is the level of significance at which the test is con¬ 
ducted. The Z scores are shown for the respective critical values. These Z scores 
can be said to depend on both the level of significance and on whether the test 
is one- or two-tailed. 


In this approach we calculate the Z-score using the sample data provided. U 
t e score for the sample data lies outside the relevant Z critical values, then 
the decision rule will be to reject the null hypothesis (H 0 ). 

this“^"^;;^ th ; ough a worked example betore usin ' 

uur seven step plan for testing hypotheses. 


WORKED 
EXAMPLE 7.8 


It is thought that the average w/nac* -f 

standard deviation of £30 a Q g , f 8 8rge com P an y is £130 a week 
to be £123. e -^0 employees finds the sample 


Solution 

Test H 0 : p = £ 13o 
H i : A * £130 
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USin l^ 5 . Per Cent level of significance, 
expressed'as r taHed ** SinCe the alternative hypothesis (HJ has been 

critical value, then reject* f °' ^ Sample data and found to be outside the Z 


figure 7.10 



Here X = 123, 


gives Z = 


X-p 



123 - 130 
30/VlOO 

123 - 130 



Z = -2.33 


Our Z score is outside the Z critical value of -1.96, so we reject H 0 . 


We can now formally set out our seven-step plan for testing a hypothesis, using 
this example to illustrate each step. 
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WORKED 
EXAMPLE 7.9 


figure 7.11 


METHODS FOR BUSINESS AND ECONOM 


Step 

1 State hypotheses 

2 State significance level 

3 State critical values 

4 Calculate the Z score for the 
sample 

5 Compare this Z sample score 
with the Z critical values 

6 Come to a conclusion 

7 Put your conclusion in words 


Example 


H 0 : P = £130 
H x : p * £130 
5% (0.05) 

Z 1 = +1.96 
Z 2 = -1-96 
Z = -2.33 

Here Z = -2.33 is outside Z 2 = -1.96 


Reject H 0 ^ iU .. 

The sample evidence does not support the null 

hypothesis at a 5 per cent level of significance 


Note: A sketch diagram will usually help in Step 3 (e.g. Figure 7.9(a) for a 0.05) 


It will be useful for you to see this seven-step plan applied to a couple of 
examples, before trying it out yourself. 


A manager of a health store believes the average amount spent per week on 
vitamin C supplements to be £2.50. A random sample of 100 shoppers found an 
average expenditure on such supplements of £2.40 with a standard deviation of 
£0.40. Test the manager’s belief at a 5 per cent level of significance. 

Solution 

Note: We shall use a two-tailed test here, since any divergence from £2.50 
(above or below) would invalidate the manager’s belief. Since only the sample 
mean (s) is available, we use this instead of the population mean (cr) in 
calculating the standard error. 

Step 1 H 0 : p = £2.50 
H ± p * £2.50 

Step 2 5 per cent (0.05) level of significance 
Step 3 Critical values for Z: ±1.96 





















WORKED 
EXAMPLE 7.10 


fi gure 7.12 
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Step 4 For sample data 


II 

' 

'XI 

2.40 - 2.50 

ATn 

0.40/ 

/ VlOO 

-o.io 

-0.10 

0.40/ 

/io 

0.04 ~~ 2 


-2.50 is outside -1.96 
Reject null hypothesis (H 0 ) 

Evidence does not support the health store manager’s belief that average 
spending on Vitamin C supplement is £2.50 per week. 


Step 5 
Step 6 
Step 7 


The expected mean diameter of a batch of cables is 2 cm. A quality control 
inspector takes a sample of 64 cables and finds that the sample has a mean 
diameter of 1.94 cm and a standard deviation of 0.4 cm. Test whether the 
manufacturing process is now producing cables with a smaller diameter than 
previously at a 1 per cent level of significance. 


Solution 

Note: We shall use a one-tailed test here, since our concern is that the 
manufacturing process is producing smaller diameter cables than claimed. Again 
we use sample standard deviation (s) as an approximation for the population 
standard deviation (cr). Our test is at the 1 per cent level of significance this time; 
i.e. we are less willing than in the previous worked example to take the risk of 
rejecting the null hypothesis (H 0 ) when it is in fact true. 

Step 1 H 0 ; p = 2.00 cm 
H x : p < 2.00 cm 

Step 2 1 per cent (0.01) level of significance 
Step 3 Critical value for Z: -2.33 
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SELF-CHECK 

QUESTIONS 


ACTIVITY 6 


Step 4 For sample data 


Z = 


X-f* 



1.94 - 2.00 



-0.06 _ -0.06 

0.4/" " 0.05 
/8 


Step 5 -1.2 is inside -2.33 
Step 6 Accept null hypothesis (H 0 ) 

Step 7 There is insufficient evidence from the sample data to reject the initial 
hypothesis that the manufacturing process is still producing cables of 
diameter of 2 cm. We would be taking more risk than we are willing to 
take if we were to accept the alternative hypothesis that the cables are 
now being produced with a diameter of less than 2 cm. 


You can now test your familiarity with this seven-step approach with the follow¬ 
ing Self-check questions. There are also more questions for practice (Review 
questions) at the end of this chapter. 


7.6 A firm manufactures light bulbs and the sales manager claims that tests 
have proved that the average life of these bulbs is 1,000 hours. We wish 
to buy some of these light bulbs but before doing so we wish to check 
the claim that their average life is correct. We therefore take a sample of 
36 bulbs and test them; we find that the average life of the bulbs in the 
sample is 940 hours with a standard deviation of 126 hours. Is the 
manufacture’s claim justified? Test at both (a) 5 per cent (0.05) and 

(b) 1 per cent (0.01) levels of significance. 

7.7 A factory manager asserts that the mean running time of machines is 
14 hours a day. A random sample of 64 machines shows that their mean 
running time was only 13 hours 20 minutes, with a standard deviation of 
3 hours. Test the manager’s assertion at the 5 per cent significance level. 

Note: Keep the units the same throughout (i.e. hours or minutes). 

7.8 A machine produces electronic components whose lifetimes have a mean 
of 100 hours and a standard deviation of 14 hours. A new technique is noW 

'??n! ICed ' nt0 th6ir production - A sample of 49 components gives a mean 
of 104 hours. Has the new technique produced longer lasting compone" ts? 
Test at a 5 per cent level of significance. 

Note: Answers can be found on pp. 349 - 351 . 


Hypothesis testing or significance testing 

A company believes that the average amount spent by customers has changed 
from its original value. The accounting department, however, disputes this 
and maintains that sales remain at £1.48 per customer. To support their view the 

accounting department quote the following random sample of spending (£> •» 75 
customers. 
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1.61 

1.16 

1.72 

1.11 

1.58 

1.44 

1.37 

1.60 

1.71 

1.10 

1.50 

1.87 

1.54 

1.09 

1.41 

1.52 

1.59 

1.10 

1.40 

1.60 

1.68 

1.30 

1.69 

1.55 

1.59 

1.54 

1.80 

1.55 

1.52 

1.37 

1.15 

1.44 

1.43 

1.81 

1.53 

1.03 

1.61 

1.72 

1.29 

1.35 

1.09 

0.88 

0.90 

1.06 

1.83 

1.80 

1.33 

1.91 

1.31 

1.42 

1.86 

1.62 

1.71 

0.95 

1.58 

1.45 

1.59 

1.79 

1.01 

1.13 

1.53 

1.46 

1.67 

1.67 

1.53 

1.75 

0.94 

1.01 

1.42 

2.07 

1.23 

1.23 

1.41 

1.46 

1.19 







Use a spreadsheet with which you are familiar to conduct the following tests. 

1 Test, at the 5 per cent significance level, the hypothesis that the average 
amount spent by each customer remains at £1.48. 

2 Test the same hypothesis at the 1 per cent significance level. 

Conduct the hypothesis test: 

Step 1 State the hypothesis: H 0 : p = £1.48: : p * £1.48 


F 

G H 

/ 

2 H 0 : 

Average equals 

1.48 

3 H ± : 

Average is not equal 

1.48 


Step 2 State the significance level: 0.01, 0.05 or 0.1 (1%, 5% or 10/6) for this 
example use 0.05 


F G 

5 Sig level 0.05 

Step 3 State the critical values 


H 


7 

8 


Critical values 


Lower 

=N0RMSINV($G$5/2) 


Upper 

=l-NORMSINV($G$5/2) 


Step 4 Calculate the Z statistic using the formula: 
X - p 


z = 


s 

■in 


sample mean - hjpotfiesis edjn^ 

standard error 


10 Z statistic 


= (D4-I2)/D6 


+ . 7 value to the critical values. 

Step 5 Compare the l between the critical values then 

Step 6 Come to a conclusion If the L va 

accept the null hypothesis that p - Ll* 
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The decision on whether to accept or reject the null hypothesis can als 

by the computer using the logical operators IF(), and 0R(). ° be made 

The IF function returns one value if the logical_value evaluates to TRUF h 
another value if it evaluates to FALSE i.e.: and 


IF(logical_test, vaiue_if_true, value_if_false) or 
=IF(True, Then return this, Else return this ) 


The OR function returns TRUE if one or more of the 
FALSE if all the arguments are false. 


arguments is true and returns 


=OR(logical 1 , logical 2) 

These two functions can be combined to make the decision - thus: 


12 Compare =IF(0R(G10<G8, G10>H8), ‘REJECT’,‘ACCEPT’) 


This can be translated as: 

If the value in cell G10 is less than that of G8 OR greater than that of cell H8 
then return REJECT else return ACCEPT. 


Now try changing the value in cell G5 to 0.1. 

This construction can only be used for two-tailed tests although combinations of 
the logical operators can be used to develop both one-tailed tests and two-tailed 
tests. A response to this activity can be found on p. 469. 


7.7 Student t distribution 


For repeated small samples (n < 32) it is no longer accurate to assume that the 
distribution of sample means follows a Normal distribution. As a result we can 
no longer use the Z tables to work out the probabilities. Instead, we assume that 
the distribution of sample means follows the Student t distribution (called ‘t’ 
distribution for simplicity). This distribution actually changes as the sample size 
changes. We must therefore apply different probabilities to samples taken with 
different sizes. The concept of degrees of freedom (v) captures this idea, where the 
number of degrees of freedom is expressed as v = n - 1, where // is sample size 
(see Figure 7.13). 

We can illustrate this idea using 24 per cent (0.025) in the tail of the t 
distribution: 


v (11 - U 0,025 


1 

2 

5 

10 

MO 


12.700 
4. MOM 
2.571 
2.228 
2.042 
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figure 7.13 

Showing how the 

distribution of 

sample means (X,) 
expressed in terms 
of standard errors 
(f), takes different 
shapes depending 
on the number of 
degrees of freedom 
v(=n- 1) 


figure 7.14 

Showing how the f 
score for 2§-% 

(0.025) in the right- 
hand tail will vary 
depending on the 
number of degrees of 
freedom 
v (= n - 1) 
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Notice that the critical values (i.e. number of standard errors from the mean) 
vary depending on the value of v (= n - 1). So with a sample size (n) of 3 and v 
(_ „ _ i) 0 f 2 , we must be 4.303 standard error from the mean to get 21 per cent 
(0.025) in either tail. However with a sample size (n) of 11 and v (= n - 1) of 10, 
we need only be 2.228 standard errors from the mean to get 24 per cent (0.025) 

in either tail. 


Critical values 

We express the critical values for the t distribution as 

t(X,V 

where « = level of significance shown in the tables 
v = number of degrees of freedom 

So for a = 0.025, and v = 2 in the table above, we can write 
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WORKED 

EXAMPLE 


f«,v — fo.025,2 — 4.303 
and when v = 10 

t a ,v = ^o.o 2 s,io = 2.228 


t statistic 

We calculate the t statistic or t score exactly as we did for Z, i.e. we use the 
sample data to find the number of standard errors from the mean: 



or 


t = 


Xi-P 



As before we use the population standard deviation (a) or the sample standard 
deviation (s) for calculating t, depending on the information we are given in 
the question. 


t tables 

The full t tables to be used in finding the appropriate critical values can be 
found in Appendix 5, p. 318. 

It will be useful to go through our seven-step plan for a problem involving 
small samples and therefore the t distribution. Our steps are exactly as before, 
except in the way we calculate the critical values. 


A company requires batteries for its transport vehicles that last on average 
.11 20,000 kilometres before replacement. A battery company claims that it can 

. supply batteries that meet this requirement. The buying company decides to test 

that claim and buys 16 batteries and finds that they last an average of 19,500 
kilometres with a standard deviation of 1,200 kilometres. Test the suppliers claim 
against the buyers concern that batteries last less than 20,000 kilometres, at a 
5 per cent level of significance. 

Solution 

Note: This is arguably a one-tail test because the buyer is only worried if the 
batteries fall below the claim of 20,000 kilometres. The sample size is less than 
32, so it is a t test example. We use s rather than a in calculating standard errors. 

Step 1 H 0 : p = 20,000 km 
Hi : p < 20,000 km 
Step 2 a = 0.05 
Step 3 v = n - 1 
= 16-1 
= 15 

to.05,15 = -1.753 
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figure 7.15 



Step 4 



(here s for a) 


19,500 - 20,000 
1,200/ 

/W16 


-500 

300 


-1.667 


Step 5 -1.667 > -1.753 
Step 6 Accept H 0 

p = 20,000 km 

Step 7 There is insufficient evidence for us to conclude that the batteries perform 
less well than is claimed. 


SELF-CHECK 

Questions 


7 9 It is believed that the mean time that third year students spend in the 
library per week is 18 hours. Assume that the time spent in the library is 
normally distributed. A random sample of 25 students gives a mean time of 
18 6 hours with a standard deviation of 1.02 hours. Test the view that the 
mean time spent in the library is greater than 18 hours at the 5 per cent 

significance level. 


. suppose we want to test, on the basis of a random sample of eight 

kea+hor the average fat content of a certain make of ice cream 
samp., mean for ,a, content is found to he 

l' cent with a sample standard deviation of 0.96 per cent. What 

co/cluslon would you com. to at the 1 per cent significance level? 

„ nrnHiictlon line produce an average of 154 faulty items 
W ° rk hlft °A training package has been developed and tested on a random 
sample of 20 workers. The twenty workers in the sample produce an 
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average of 141 faulty Items with a standard deviation of 12 items. The 
average or x t0 aM workers only if it can be shown 

dimples worker performance. Based on the sample should the 
Z proceed with the programme? Test at the 1 per cent level of 

significance. 

Note: Answers can be found on p- 351. 


Chi-squared test 

The Chi-squared (x 2 ) test looks not at an individual item of data (i.e. a single 
parameter) but at the whole distribution. As a result it is known as a non- 
parametric test, unlike the Z and t tests which are known as parametric tests with 
their focus on a single parameter such as a sample mean which is then com¬ 
pared with an allegedly known value of a parameter from the population. 

For non-parametric tests, such as the Chi-squared test, the null hypothesis 
(H 0 ) and alternative hypothesis (Hj) are defined in terms of the distribution as a 
whole. Usually we test: 

H 0 : the distribution as a whole is evenly (equally) spread. 

H, : the distribution as a whole is not evenly (equally) spread. 

X 2 statistic 

The x 2 statistic or score makes use of actually observed values (0) which are 
then contrasted with the expected values (£) we should anticipate if the null 
hypothesis (H 0 ) is in fact true. It is defined as: 

X 2 = X 

f=i 

where O = observed values 

E = expected values (if null hypothesis, H 0 , is true) 
n = number of observations in the sample 

The x 2 distribution follows the pattern shown in Figure 7.16. 

Note that, as with the t distribution, the x 2 distribution actually changes as 
the sample size (n) changes. We therefore again make use of the idea of i/QT'tt." 
of freedom (v) (see p. 170), with the number of degrees of freedom expressed as 
v — n — 1, where n corresponds to the number of observations in the sample- 
Of course if the distribution as a whole is evenly spread throughout, as oui 
null hypothesis H 0 suggests, then the observed values (O) would exactly equa 
the expected values (£). It follows that (O - £) in the numerator of tire x 2 statistic 
will equal zero for every one of the n observations in the sample, and summing 
the squares of zero will still give us zero for x 2 - In other words x = vV 
correspond to a perfectly evenly spread distribution. The less evenly spread the 
distribution, the greater the discrepancy between any observation (O _ 

expected value (£), when the latter is calculated on the assumption that tv 
distribution is evenly spread (i.e. H„ is true). 


(0 - g ) 2 
E 
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Critical values 

The critical values are those values for yf outside of which we are willing 
to reject the null hypothesis (H 0 ) that the distribution is evenly spread. In this 
case we reject H 0 , where the observations (O) are sufficiently different from 
the expected values (E) we should anticipate if the distribution really is evenly 
spread. 

► As we have already noted the x 2 distribution will vary with the number of 
degrees of freedom (v), where v = n - 1. It follows that the critical values 
will therefore vary with v. 

► The level of significance (a), as before, represents the maximum risk we 
are willing to take in rejecting the null hypothesis (H 0 ) when it is in 
fact true. 

The critical values for the % 2 distribution can therefore be expressed as: 

x 2 a , v 

where a = level of significance 

v = number of degrees of freedom (n - 1) 

All tests involving the % 2 distribution are one-tailed tests. 

Appendix 6 (p. 319) gives a full table of critical values for a given level 
of significance (a) and a given number of degrees of freedom (»•). For example, 
as shown in Figure 7.17, for a 5 per cent level of significance (or = 0.05) and 
10 degrees of freedom (e = 10), we have a critical value of 18.3 for y*. In other 
words if, from our sample data, we calculate the r statistic as over 18.3, then 
we will reject the null hypothesis ll„ that the data is evenly spread. There will 
then be less than a 5 per cent chance of being wrong in deciding to reject the 
null hypothesis. 
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figure 7.17 

finding the critical 
value of x 2 


WORKED 
EXAMPLE 7.12 



Over a year a large company experiences 100 computer breakdowns. These are 
classified into the four time-period slots indicated below. 


Time of computer breakdowns 

Number of breakdowns 

9.00 a.m.-ll.OO a.m. 

22 

11.00 a.m.-l.OO p.m. 

18 

1.00 p.m.-3.00 p.m. 

26 

3.00 p.m.-5.00 p.m. 

34 


Test the hypothesis that computer breakdowns are spread evenly through the 
working day, using a 5 per cent (0.05) level of significance. 


Solution 

A/e can follow the same seven steps in testing a hypothesis as for the - •''' 
distributions. 

Step 1 H 0 : Breakdowns evenly spread 

H x : Breakdowns not evenly spread 

Step 2a- 0.05 
Step 3 v = n - 1 
= 4-1 

= 3 

2 — 7 81 

S i e p 4 Calculate the X 1 statistic from the sample data 















SAMPLING AND TESTS OF HYPOTHESES 177 


0 

E 

(O-E) 

(0 - E) 2 

(0 - E) 2 
E 

22 

25 

-3 

9 

0.36 

18 

25 

-7 

49 

1.96 

26 

25 

+1 

1 

0.04 

34 

25 

+9 

81 

3.24 





5.60 


Note: If H 0 is true, then we expect 25 breakdowns to occur in each of the four time-periods of two hours. 

X 2 = 5.60 

Step 5 5.60 < 7.81 
Step 6 Accept H 0 

Step 7 Cannot reject H 0 that computer breakdowns are evenly spread over time. 

In other words there is not enough evidence to conclude that the computer 
breakdowns that occur at different times are due to anything other than 
chance. 


Data shown in table or matrix form 

Often the data are shown, or can be represented, in table or matrix form. As we 
shall see from Worked Example 7.13 we follow exactly the same procedures as 
above, with one difference which relates to calculating the number of degrees of 

freedom (v): 

v = (r- 1) x (c - 1) 

where v = number of degrees of freedom 
r = number of rows in table 
c = number of columns in table. 

As we shall also see in Worked Example 7.13, we can devise a useful formula for 
calculating the expected values (£) when data are presented in tabular form: 

col umn total 
E = row total x tota f 

where E = expected value 


WORKED 
EXAMPLE 7.13 


f mh satisfaction was undertaken for systems analysts and computer 
k survey o j | arge fjrm a questionnaire asked 100 workers of each 

>rogrammers employ ^ satisfied with their job. The results are shown below: 
ype whether or not they were sai^M 


Type of employee 

Systems analysts 
Computer programmers 


Satisfied 

Not satisfied 

Totals 

63 

37 

100 

53 

47 

100 

116 

84 

200 


Totals 
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, . „„^hin uotween the type of employee and job 
Is there a significant relationship between in w 

satisfaction? Test at a 5 per cent level of significance. 


Solution 

This is a x 2 test since we are test ' n ^ nu ** hypothesis ^o) °f n o association 
between type of employee and job satisfaction against the alternative hypothesis 
(Hi) tha t there is such an association. We are using observations from sample 

data (200 employees) to help test H 0 . 

If ‘everything is in proportion’ and there is no association, then H 0 is true. 
We should then expect that the overall proportion of satisfied employees should 
be reflected in the totals for each type of employee claiming to be satisfied. 

We return to this idea below when working out the expected values ( E). 

Sometimes tabular data are given with row and column totals provided. If not, 
it will help if you insert such row and column totals. Note also that the sum of 
the respective row and column totals is referred to as the ‘grand total’, here the 
200 employees involved in the survey. 

We can follow our usual seven steps, but remembering to apply the 
procedures already mentioned for data in tabular form: 

Step 1 H 0 : no association between type of employees and job satisfaction 
(i.e. responses are in proportion). 

H ± : association between type of employee and job satisfaction 
(i.e. responses are not in proportion). 

Step 2 a = 0.05 
Step 3 v = (r - 1) x (c - 1) 
v = (2 - 1) x (2 - 1) 
v = 1 


(Note: when using tabular data, do not count the row total or column total when 
calculating v) 

Xa.v ~ X 0.05,1 = 3.84- 


Step 4 Calculate the X 2 statistic from the sample data. 



(0 - £) 2 
E 


where 0 = observations 

E = expected values if H 0 
(no association) is true. 

Here we can apply our formula to work out expected values. 

£ = row total x co * umn total 
grand total 


£ (Systems analysts satisfied) 


= 100 x 


119 

200 


58 


100 x 


84 

200 


42 


£ (Systems analysts not satisfied) 
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Review 

Questions 


E (Computer programmers satisfied) = 100 x M®. = 58 

200 

E (Computer programmers not satisfied) = 100 x — = 42 

200 

^* u l a simply helps us impose the overall proportions of satisfied/not 
8 is ie on the totals for systems analysts and computer programmers 

respectively. The resulting values tell us what to expect if the null hypothesis H 0 
(no association) is true. 

We can express our table of expected values (E) as: 


Expected values 

Satisfied Not satisfied 
Systems analysts 58 42 

Computer programmers 58 42 


We can now calculate x 2 , as follows: 


0 

E 

(0- E) 

(0 - E) 2 

(0 - E) 2 

E 

63 

58 

+5 

25 

25/58 = 0.431 

37 

42 

-5 

25 

25/42 = 0.596 

53 

58 

-5 

25 

25/58 = 0.431 

47 

42 

+5 

25 

25/42 = 0.596 





2.054 


f_ = 2.054 

Step 5 2.054 < 3.84 
Step 6 Accept H 0 

Step 7 No association between type of employee and job satisfaction. There is 
not enough evidence to conclude that these different types of employees 
have different views as to job satisfaction. 

Note: Review question 7.15 is a x 2 question. 


7 1 The manageress of a restaurant claims that her weekly turnover is £3,130 
with a standard deviation of £115 per week. If the turnover is normally 
distributed and a random sample of 25 weekly turnover figures is recorded, 
what is the probability that the sample mean will be: 

(a) less than £3,100? (b) greater than £3,190? (c) between £3,110 and 

£3,180? 


. , e business studies graduates, one year after graduating, is 

7.2 The mean salary deviation of £1,300. What is the probability that 

£12,500 with a stardard de ^ ^ Qf at least £13 ,000? 

a random sample of 4 ) or uic 
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7.3 A sample of 49 observations is taken from a normal population. The sample 
mean is 58, and the standard deviation of the sample is 10. Determine the 
95 per cent and the 99 per cent confidence intervals for the population mean. 

7.4 A wine shop owner wishes to estimate the mean number of bottles sold to 
account-holders per month. A sample of 60 accounts gives a mean of 8.6 
bottles per month with a standard deviation of 2.3. Develop a 99 per cent 
confidence interval for the population mean. 

7.5 The Durable Construction Company produces frames for solar panels. The 
frames need to withstand certain pressures. A sample of 45 frames is taken 
from the production line and tested for the weight they can support 
before they distort. The test results from the sample of 45 frames suggest 
an average weight of 39.8 kilos before distortion with a standard deviation 
of 6 kilos. 

(a) Construct a 99 per cent confidence interval for the population mean. 

(b) Construct a 90 per cent confidence interval for the population mean. 

(c) How large should the sample be if we want a 99 per cent confidence 
interval for the population mean with a margin of error of 2 kilos? 


7.6 Transport economist, Sisson, wishes to estimate how often residents of 

Cambridge use the city bus system per month. 

(a) How many residents must be surveyed if he wants to develop a 95 per 
cent confidence interval with a margin of error of three trips per 
month? A previous pilot study gave the standard deviation as 14 trips. 

(b) If Sisson randomly selects 160 residents and obtains a mean of 22.4 
trips with a standard deviation of 15, construct a 95 per cent 
confidence interval for the population mean. 


7.7 The owner of a restaurant believes that the average amount spent on a 
meal per couple is £50.00. She takes a random sample of 40 couples and 
finds that the mean expenditure is £53.70 with a standard deviation of 
£7.80. Test the owner's claim at the 5 per cent level of significance. 


7.8 A car manufacturer states that its new model does 32.5 mpg. A recent 
independent study of 50 of the new cars selected randomly showed a mean 
mpg of only 30.4 with a standard deviation of 5.3 mpg. Test the view that 
the manufacturer's claim is too high using a 1 per cent level of significance. 


7.9 


The expected life of a car engine is 90,000 miles. A random sample of l9() 
cars gives a mean life of 96,700 miles with a standard deviation of 37,500 
miles. Test whether the cars are lasting longer than expected at the 1 P el 
cent level of significance. 


7.10 An estate agent advertises that it can sell houses in 40 days or less- ■ 
random sample of 64 recently sold homes shows a sample mean sell' 1 ^ 
time of 45 days with a standard deviation of 20 days. Using the 5 p er 
level of significance, test the validity of the agent's claim against a riV ‘ 
claim that this is untrue. 
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* •11 ^ poll of 121 13 ri 

CDs they buy a year is 7 01 w Fh U< ^ entS revea ^ s ^ at the mean number of 
claim that the mean for all <t, h 3 Standard deviation of 3.74. Test the 
level of significance. d S1S less than 7 5 CDs at the 1 per cent 

7*12 A machine ic cnf f_ 

operator that the rodTaTeTootn ThT* 1 V™ CmS ' " ls fel * ^ the 
1 per cent significance WpI f , The hypothesis is to be tested at the 
8.7, 8.4, 8.6 8.3 8 S a h « c !° m the followin g sample of rods: 9.0, 8.5, 
of the sample? ' -hat conclusion can be drawn on the basis 

13 mOTe^au^ckL^ttT daims that new Procedures enable it to dispatch orders 
delav wac 9 rwi 30 previous ty- Records show that the previous average 
ays ' ^ random sample of 17 orders using the new procedure 
owe a elay time of 18 days with a sample standard deviation of 2.5 

ays. s the claim made for the new procedures supported at the 5 per cent 
level? 


7 *^ ^ recruitment agency claims that four bedroom detached houses near the 
Cambridge Science Park can be rented for £900 per month on average. A 
random sample of 20 houses gave a mean rental of £1,050 per month for 
this type of property with a sample standard deviation of £290 per month. 
Test the agency's claim against the claim that rentals are higher at the 10 
per cent significance level. 

7.15 Using the following observations recorded by a researcher, check if the 
assumption of no relation between lung cancer and smoking can be ac¬ 
cepted at the 5 per cent level of significance. 


Observation table 



Smokers 

Non-smokers 

Totals 

Cancer sufferers 
Non sufferers 
Totals 

200 

150 

350 

200 

450 

650 

400 

600 

1,000 


Answers to Review questions can be found on p. 425 


Further study and data 


Texts 

ii- r H993), Quantitative methods for accounting ami 
Bancroft, G. and O Su w ' H1U chapters 15-18. 

business studies, .iru «• - Quantitative methods for business decisions, 4th 

.. ~ 






